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Preface 



The present volume is the first of two planned to provide state-of-the-art expert 
reviews of central topics in modern natural products chemistry and secondary 
metabolism. Many scientists not directly involved in these areas still view the field 
of natural products as focused primarily on the isolation, structure elucidation, 
and cataloging of new compounds (i.e. “grind and find”), or on their chemical 
synthesis. However, two revolutions in experimental techniques since the early 
1950’s have completely transformed the understanding of chemical and biological 
relationships between highly diverse natural products. 

The first was the use isotopes to label precursors and follow the fate of 
key atoms during biochemical transformations to a final product. Advances in 
methodology that began in the mid-1970’s, especially using NMR to detect stable 
isotopes, dramatically expanded the level of mechanistic detail available to study 
in vivo conversions of secondary metabolites. 

The second revolution started in the late 1980’s as researchers developed 
methods to identify, purify and genetically manipulate individual enzymes re- 
sponsible for the intricate steps leading to complex natural products. This has 
expanded the field from building an encyclopaedia of natural materials to being 
able to initiate control of biochemical pathways to secondary metabolites. Recent 
advances suggest it may soon be possible to rationally manipulate biochemical 
pathways in vivo to rapidly produce almost any target molecule, including non- 
natural variants, in substantial quantity. Since a host of important pharmaceutical 
agents or their precursors are produced by fermentation, the increased under- 
standing and control of biosynthesis will lead not only to improved production of 
natural products, but also to an arsenal of modified derivatives for combinatorial 
approaches in drug discovery. 

In the current volume, Tom Simpson begins with a review of the modern iso- 
topic techniques useful for elucidation of the overall sequence of events in the 
construction of a secondary metabolite. The examples chosen to illustrate the 
applications are primarily polyketides, but the techniques can be applied to any 
class of natural product. In the second chapter, /im Staunton and Barrie Wilkinson 
explain the individual steps by which non-aromatic polyketides are assembled, 
and illustrate the genetic and biochemical studies that have recently led to a 
detailed knowledge of the remarkable machinery that manufactures complex 
antibiotics like erythromycin from short chain precursors such as propionate. In 
the third chapter, Tadhg Begley and co-authors describe the new genetic and 
enzymatic investigations that provide our current mechanistic understanding of 
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biosynthesis of more than a dozen vitamins and enzyme cofactors. This review 
also identifies limitations in our current grasp of steps involved in the formation 
of these crucially important coenzymes. The final chapter by Alan Battersby and 
Finian Leeper provides a detailed overview of the formation of vitamin Bi 2 > one 
of the “pigments of life” and the most complex natural product to be found in a 
wide variety of organisms. In this, as in the other chapters, a wealth of fascinating 
chemistry is uncovered, which leaves one marvelling at the elegance and efficiency 
of biosynthetic processes. 

Finally we would like to point out that the authors were asked to review the 
developments in their field over the last five to ten years and particularly their own 
contributions to the field. If any older work or work from other laboratories which 
has been significant in the development of the field has not been mentioned, it is 
for this reason only. 

Finian J. Leeper, Cambridge October 1997 

John C. Vederas, Edmonton 
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The application of stable isotope labelling methods to the study of biosynthetic pathways of 
secondary metabolites is illustrated by describing studies on the biosynthesis of polyketide 
and terpenoid metabolites of fungal origins. The use of doubly '^C-labelled precursors pro- 
vides information on the mode of assembly, cyclisation and rearrangements of polyketide and 
terpenoid carbon skeletons. More detailed information on the oxidation levels of biosynthetic 
intermediates and oxidative and reductive modifications occurring during later stages of the 
pathway can be obtained by the use of precursors labelled with '^C in conjunction with 
and '*0. These methods are illustrated with reference to studies on the mechanistic and 
stereochemical details of polyketide chain assembly and on compounds of mixed poly- 
ketide-terpenoid (meroterpenoid) origins. Fluorine labelled precursors can provide informa- 
tion on key steps as well as producing interesting analogues of the natural metabolites. 
Application of labelling in conjunction with other isotopes, e.g. and '*0, is illustrated 
by studies on the formation of 3-nitropropanoic acid. Recent work on the enzymes of poly- 
ketide biosynthesis also makes use of multiple stable isotope labelling to assist in the elucida- 
tion of 3D protein structure and to obtain information on the mode of action of some of these 
enzymes. 

Keywords: Stable isotopes, fluorine, polyketides, fungi, proteins. 
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1 

Introduction 

The study of biosynthetic pathways received a major impetus in the early 1970s 
with the advent of pulsed Fourier-transform NMR spectrometers which greatly 
facilitated the routine determination of NMR spectra of small amounts of 
natural products. In addition, precursors enriched with and other stable iso- 
topes, e.g. ^H, were becoming more readily available. These were very 

timely and synergistic developments because structures of natural products 
were beginning to be determined increasingly by spectroscopic, mainly NMR, 
methods with little or no recourse to degradative chemistry. In contrast to stable 
isotopes, classical radioisotope methods necessitated extensive degradative 
routes to locate precisely the position of incorporation of isotopic label. In addi- 
tion, molecules of interest were of increasing structural complexity and were 
often available in very limited amounts. These problems were largely overcome 
by and other stable isotope labelling methods. The crucial advance was to 
bring together once again biosynthetic techniques with methods of structure 
elucidation which complemented each other. 

The basic methodology and principles of NMR-based elucidation of isotopic 
labelling patterns have been extensively reviewed [1-6] and need not be detai- 
led here. However, it is worth emphasising that the use of singly ^^C-labelled pre- 
cursors does not give any information that could not, in principle, be gained by 
classical radioisotope methods; they merely (!) made easier the determination of 
complete labelling patterns. The really significant gains come from the use of 
precursors multiply labelled with or with in conjunction with ^H, 
and ^®N. The detection of these combinations of isotopic labels by either 
spin-spin coupling and/or isotopically shifted signals in the NMR spectra of the 
enriched metabolites allowed the incorporation of whole biosynthetic units to 
be elucidated for the first time. The biosynthetic unit could be a single bond or, 
indeed, a multi-atom unit. Thus the integrity of, for example, carbon-carbon, 
carbon-hydrogen or carbon-oxygen bonds during a complex metabolic pathway 
could be tested. The course of skeletal and other rearrangements could be traced 
and, most importantly, the biosynthetic origin of hydrogen, oxygen and nitrogen 
could be determined and the oxidation levels of the intermediates in the 
pathway established by indirect methods. This paved the way for the formula- 
tion and testing of ideas on, for example, processive pathways of polyketide bio- 
synthesis [7]. 

This article will describe some of the results which have come from the appli- 
cation of these methods using examples mainly drawn from our studies on poly- 
ketide and terpenoid metabolites of fungal origin. These will be extended with 
examples of work in other laboratories which the author considers to be parti- 
cularly effective or interesting applications of stable isotope methods. The 
article will conclude with recent applications of stable isotopes to some of our 
work on the enzymology of polyketide biosynthesis to emphasise that this new 
and burgeoning phase of biosynthetic studies brings new and equally effective 
opportunities for the applications of stable isotope labelling to the study of 
natural product biosynthesis. 
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2 

Applications of Double Labelling 

Amongst the earliest applications of labelling was the use of doubly labelled 
[1, 2- ^^€ 2 ] acetate to trace the mode of incorporation of intact acetate units into 
a wide range of metabolites. This has been one of the major recent developments 
in biosynthetic methodology and permits information to be obtained which 
would have been impossible or at best extremely difficult to obtain by classical 
radio-isotope labelling techniques. The basic concept can be illustrated by a 
model polyketide system (Fig. 1). 

If we consider a molecule of acetate in which both carbons are entirely 
([1, 2 “^^C 2 ] acetate), it contains two adjacent nuclei of spin 1/2 and so they will 
couple to each other. If this acetate molecule is incorporated intact into a meta- 
bolite then, in any individual molecule, those pairs of carbons derived from an 
originally intact acetate unit must necessarily both be enriched simultaneously 
and so will show a mutual coupling. Thus if C-1 is enriched, then C-2 

must also be enriched. In the resultant NMR spectrum, the natural ab- 
undance signal is flanked by coupling satellites (Fig. lb). By analysing 

the coupling patterns, information is obtained on the way in which the precursor 
molecules are assembled on the enzyme surface and on the way the polyketide 
chain folds up prior to condensation and cyclisation. If at any stage in the 
biosynthesis the bond between two carbons originally derived from an intact 
acetate unit is broken, then the coupling is lost and these carbons appear 

simply as enhanced singlets, as shown for C-3 and C-4 in Fig. 1 c. In this way 
bond cleavage and rearrangement processes occurring during biosynthesis can 
be detected. 
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Fig.l a-c. Simulated proton noise decoupled NMR spectra of a polyketide-derived moiety: 
a at natural abundance; b enriched from [ 1 , 2“*^C2]acetate; c after cleavage or rearrangement 
of an originally intact acetate unit 
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Couplings are generally between 30 and 80 Hz and increase in pro- 
portion to the amount of “s” character of the atoms in the bond. Hence sp^-sp^ 
bonds are typically 35 Hz; sp^-sp^ are 45 Hz, and sp^sp^ are 60 Hz. Substitution 
on one or both atoms hy oxygen increases the size of the coupling. Usually the 
couplings will be sufficiently different in magnitude to enable the pairs of 
coupled carbons to be matched up. In addition, if the spectrum has heen un- 
ambiguously assigned it will be apparent which carbons are mutually coupled. 

It should be noted that the labelled precursor is normally highly diluted by 
the endogenous pool of unlabelled acetate and so no coupling is ob- 

served between pairs of atoms derived from different acetate units. In some 
cases such “extra” inter-acetate couplings may be observed and in a few ex- 
ceptional cases where little or no dilution by endogenous acetate occurs, the 
further couplings which result can make interpretation of the spectra 

problematic. This problem can usually be overcome by the simple expedient of 
diluting the labelled precursor 2- 3-fold with unlabelled material prior to incor- 
poration 

The contiguous double labelling technique has been used extensively. This is 
partly because it extends the permissible dilution factor to at least 2000, as small 
coupling satellites can be observed with more certainty than the corres- 
ponding enrichment from a singly labelled precursor. However, its success is 
largely because of the extra information obtainable in respect of bond cleavages, 
rearrangements and symmetrical elements of intermediates. Some applications 
of the technique are illustrated below. 

Incorporation of [1, 2 -^^€ 2 ] acetate is invaluable in distinguishing among 
alternative foldings of linear polyketide or terpenoid precursors prior to cyclisa- 
tion and subsequent modifications. Thus, of the three possible foldings of the 
heptaketide precursor of herqueichrysin (1), a phenalenone metabolite of 
Penicillium herquei, the specific folding shown in Scheme 1 was unambiguously 
established [8] from the observed couplings in the NMR spectrum of 

the enriched metabolite (Fig. 2). 

The labelling pattern of paniculide (2) derived from [1, 2 -^^€ 2 ] acetate in 
callus cultures of Andrographis paniculata demonstrates that the folding of 
farnesyl pyrophosphate in the biosynthesis of (2) must be as shown in Scheme 2 
[9]. This is an example where the increased sensitivity associated with doubly 
labelled acetate was essential. 

More recently, we have determined the labelling pattern from incorporation 
of [1, 2 -^^€ 2 ] acetate into astellatol (3), a novel sesterterpenoid metabolite of 



c, -^[O] 




Scheme 1 
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Fig. 2. NMR spectrum of [1, 2-'^C2]acetate enriched herqueichrysin (as its triacetate) 
determined at 67.89 MHz. Note the severe overlap of the coupled signals in the aromatic 
region and the second-order couplings arising from the similarity of the chemical 

shifts of C-8 and C-9 



Me-COoNa 



Scheme 2 




Aspergillus stellatus. This supports its biosynthesis via cyclisation of geranyl- 
farnesyl pyrophosphate followed by rearrangement as shown in Scheme 3 [10]. 
Surprisingly, the folding pattern must be different from that involved in the bio- 
synthesis of stellatic acid (4), a previously isolated sesterterpene metabolite of 
A. stellatus [11]. 

The use of ^^C 2 -acetate labelling to detect the involvement of symmetrical 
intermediates has found extensive use in biosynthetic studies. Thus, as shown in 
Scheme 4, incorporation of [1, 2- ^^€ 2 ] acetate into ravenelin (7) in cultures of 
Helminthosporium ravenelii resulted in the observation of a randomisation of 
labelling in ring-C which confirmed the predicted involvement of a symmetri- 
cal benzophenone intermediate (6), itself derived from cleavage of an octaketide- 
derived anthraquinone, presumably helminthosporin (5) [12]. 

The more complex xanthone metabolite, tajixanthone (11), was similarly 
shown to be formed via a symmetrical intermediate. Incorporation of [1, 
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Scheme 4 
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2-^^C2]- and [^HjJacetate gave the results summarised in Scheme 5 [13]. The 
absence of label on C-25 and C-5 indicated that cleavage of an anthraquinone 
rather than an anthrone octaketide intermediate occurred and that decarboxy- 
lation of the octaketide precursor occurs after cyclisation and aromatisation. 
The observed scrambling of couplings in ring C implies the involvement 

of a symmetrical benzophenone intermediate (8) which in turn means that ring 
cleavage of the anthraquinone precursor must precede introduction of the 
C-prenyl residue. The specificity of the labelling in the dihydropyran ring, how- 
ever, suggests that it is formed from an 0-prenylaldehyde intermediate (9) by a 
concerted “ene” reaction. 

In vitro studies of the ring closure reaction on closely related xanthone 
models result in a ds-orientation of the hydroxyl and isopropenyl substituents 
which can be explained by steric clash between the ketone and xanthone 
carbonyls in the potential intermediate ( 10) which can be avoided if the “ene” 
reaction occurs on the benzophenone intermediate (9). This observation was 
supported by the isolation of previously known Aspergillus rugulosus meta- 
bolites, e.g. arugosin A (12), as co-metabolites of tajixanthone in Aspergillus 
variecolor. The involvement of anthraquinone intermediates was subsequently 




Scheme 5 
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established when [meth^Z-^Hjlchrysophanol (13) was shown to be a specific 
precursor for tajixanthone. NMR analysis showed that specific incorporation 
of label into the aromatic methyl group of (11) had occurred [14]. Further in- 
formation on the mechanism of xanthone ring closure was obtained by analysis 
of the NMR spectrum and mass spectrum of tajixanthone isolated from A. 
variecolor grown under an atmosphere containing The intensities of the 
isotopically shifted signals for C-1, C-10 and C-11 are half those observed 
elsewhere in the molecule, e.g. C-7, C-25, C-15 and C-16, again consistent with 
the pathway outlined in Scheme 5. 







Scheme 6 
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Further modifications of this anthraquinone ring cleavage pathway can give 
rise to structurally and biosynthetically more complex metabolites. Cercospora 
beticola toxin which was found to be a 2 : 2 complex of two magnesium ions with 
two molecules of (14) [15] and the xanthoquinodins A1 (15) and B1 (16), anti- 
coccidial metabolites of Humicola sp. OF-888 [16] have labelling patterns from 
incorporation of [^^€ 2 ] acetate consistent with a heterodimer origin from alter- 
native, and as yet mechanistically obscure, couplings of an anthraquinone ( 18), 
and the derived xanthone moiety (19), as indicated in Scheme 6. In the case of 
these metabolites, the acetate incorporation pattern shows randomisation of 
labelling in the xanthone ring derived from the A-ring of the anthraquinone 
(18). In xanthoquinodin A3 ( 17), it is evident that yet further ring cleavage of the 
xanthone-derived moiety has occurred. 

In marked contrast to ravenelin and tajixanthone, no randomisation of [I, 
2 -^^C 2 ] acetate labelling was observed on incorporation into sterigmatocystin 
(20) in Aspergillus variecolor, ruling out the involvement of a proposed sym- 
metrical benzophenone intermediate [17]. 




A further illustration of the use of ^^C-labelled acetates to detect symmetrical 
intermediates is provided by LL-D253a, a chromanone metabolite of a number 
of Phoma species. Incorporation of [ 1, 2-^^C2]acetate into LL-D253a (21 ) in Phoma 
pigmentivora indicated that the molecule was assembled from intact acetate 
units as shown in Scheme 7 [18]. Thus the molecule must be formed from the 



■ • 

M6'"C02Na 

■ ; □ 
• :0 

Scheme 7 



OMe O 




condensation of two separate preformed polyketide precursors. A particularly 
surprising feature of this study was the observation of partial randomisation of 
label from incorporation of singly ^^C-labelled acetates between C-10 and C-11 
in the hydroxyethyl side chain. This is an unusual example where singly ^Re- 
labelled acetate provided more revealing information than [1, 2- ^^€ 2 ] acetate! 
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Scheme 8 



The observed randomisation of labelling in 80% of the molecules is accounted 
for by the formation of a symmetrical cyclopropyl intermediate (22) as shown in 
Scheme 8. This intermediate may undergo hydrolytic ring opening at either the 
a or p carbon. According to this scheme, 20% of the molecules not undergoing 
randomisation should have the 11 -hydroxyl derived from the atmosphere and, 
in accord with this, fermentation in an ^*©2 atmosphere resulted in an iso- 
tope shift being observed on the resonance due to C-11 in the NMR 
spectrum, the intensity of the shifted signal being approximately 20 % that of the 
unshifted signal. The mass spectrum showed an M -I- 2 peak which mass matched 
for an ion containing one atom which was approximately 20 % of the mole- 
cular ion. It is not clear whether the randomisation occurs in vivo or in vitro, but 
it is noteworthy that an exactly analogous randomisation was recently observed 
for phomalone (23), a butyrophenone metabolite of the fungus Phoma exigua 
[19]. 
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An essential, and often overlooked, feature of these studies is the necessity to 
have a completely unambiguous assignment of the NMR and NMR 
spectra of these molecules and this is often the major and most demanding part 
of the work. An interesting facet of the work on LL-D253a is that the initial 
assignment studies of the NMR spectrum revealed that the structure of the 
molecule which had been assigned independently as (24) by three different 
groups was in fact wrong! Detailed analysis of the long-range couplings 

in the NMR spectrum led to the revision of the structure which was sub- 
sequently proved by total synthesis of both the correct and the previously assig- 
ned structures [20]. 



OMe O 




OH O 




Incorporations of [^^€ 2 ] acetate have also enabled the detection of bond 
cleavage and skeletal rearrangement processes occurring during the bio- 
synthesis of a wide range of metabolites. One of the earliest examples was in the 
biosynthesis of multicolic acid (26) and related tetronic acid metabolites of 
Penicillium multicolor [21]. The observation of couplings and, more 

significantly, their absence on the resonances of certain carbons led to the 
proposal that the tetronic acids were bio synthesised via oxidative cleavage of an 
aromatic intermediate as shown in Scheme 9. These proposals were subsequent- 
ly confirmed [22] by the incorporation of 6-pentyl-resorcylic acid ( 25) and from 
^®0-labelling studies [23]. 





Similar labelling studies on aspyrone (27), a metabolite of Aspergillus melleus, 
gave the labelling pattern shown in Scheme 10 [24] which suggested that its bio- 




Scheme 10 



27 
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synthesis was either via a ring cleavage pathway or by rearrangement. The 
rearrangement pathway shown was supported by the observation of a two-bond 
coupling of 6.2 Hz between C-2 and C-8 in the NMR spectrum of 
[^^€ 2 ] acetate-enriched aspyrone. This was the first observation of such a 
coupling in biosynthetic studies. Further work which has led to a complete 
delineation of the biosynthesis of aspyrone is discussed below. 

An extreme example of diverse structures which can arise by extensive and 
varied modifications of a single polyketide-derived metabolite is provided by 
the phytotoxic spiciferones (28) -(30), spiciferinone (31) and spiciferin (32) 
produced by the fungus Cochliobolus spicifer responsible for leaf spot disease in 




Me-C02Na 
^ methionine 



O 




Me 



28 


R, = Me, R 2 = Me 


29 


R, = CH 2 OH, R 2 = Me 


30 


R, = Me, Rj = CH 2 OH 




0 






Me^ 


0 

0 



^e ^ 

Me 




A ▲ 



wheat. Despite their markedly different carbon skeletons, they were proposed to 
arise from a common polyketide precursor [25]. Preliminary biosynthetic ex- 
periments were hindered by the low yields of the metabolites, but on addition of 
methionine the yield was increased 4-fold, allowing incorporation studies with 
[^^€ 2 ] acetate and [methy/-^^C] methionine. The labelling results are consistent 
with cyclisation of a single di-C-methylated hexaketide chain followed by a 
series of retro-aldol reactions and oxidative cleavages as outlined in Scheme 11. 




Scheme 11 



32 
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3 

Assembly of Highly Reduced Polyketide Metabolites: 

'K, and ^^0-Labelling 

Despite their enormous structural diversity, polyketide metabolites are related 
by their common derivation from highly functionalised carbon chains whose 
assemblies are controlled by multifunctional enzyme complexes, the polyketide 
synthases (PKSs) which, like the closely related fatty acid synthases, catalyse 
repetitious sequences of decarboxylative condensation reactions between 
simple acyl thioesters and malonate, as shown in Fig. 3 [7]. Each condensation 
is followed by a cycle of modifying reactions; ketoreduction, dehydration and 
enoyl reduction. In contrast to fatty acid biosynthesis where the full cycle of 
essentially reductive modifications normally follow each condensation reduc- 
tion, the PKSs can use this sequence in a highly selective and controlled manner 
to assemble polyketide intermediates with an enormous number of permuta- 
tions of functionality along the chain. As shown in Fig. 3, the reduction sequence 
can be largely or entirely omitted to produce the classical polyketide inter- 
mediate which bears a carbonyl on every alternate carbon and which normally 
cyclises to aromatic polyketide metabolites. On the other hand, the reductive 
sequence can be used fully or partially after each condensation to produce 
highly functionalised intermediates such as the “Reduced polyketide” in Fig. 3. 
Basic questions to be answered are (i) what is the actual polyketide intermediate 
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Fig. 3. The assembly of fatty acids, polyketides and reduced polyketides. The “reduced poly- 
ketide” intermediate would be formed from an acetate starter by five successive condensation 
cycles. The first two cycles are condensations and are followed by condensation-ketor- 
eduction, condensation-ketoreduction-elimination, and finally a full condensation-keto- 
reduction-elimination-enoyl reduction cycle. Thus the overall reaction sequence is A, A, AB, 
ABQABCD 
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assembled by the PKS; (ii) what is the exact sequence of condensation and 
reduction steps catalysed by the PKS; and (iii) what further oxidative, reductive 
and other modifications are required to convert the product of the PKS-cataly- 
sed assembly sequence to the final observed metabolite? 

Much of our current understanding of the polyketide metabolites formed via 
these highly reduced intermediates has come from studies using and 
labelled precursors in conjunction with detailed and NMR analysis of the 
enriched metabolites [5, 6]. 

NMR, despite several inherent disadvantages, has been the nucleus of 
choice in many biosynthetic studies. Its major limitations are mainly as a con- 
sequence of the low magnetogyric ratio and the relaxation behaviour of the 
nucleus. Because it is a quadrupole nucleus (spin 1) and thus very efficiently 
relaxed, the spectral lines are rather broad and this, coupled with the low magne- 
togyric constant and the small chemical shift range for hydrogen nuclei, often 
results in poorly resolved spectra. However, the rapid relaxation and lack of any 
n.O.e. mean that accurate integration of NMR spectra is possible so that the 
relative enrichment at different sites in a metabolite can be accurately assessed. 
Another major advantage is that as a consequence of its low natural abundance 
(0.012%), much greater dilutions are tolerable than in the case of ^^C-labelling: 
a 100% ^H-labelled precursor may be diluted 6000-fold and still result in a doub- 
ling of intensity over the corresponding natural abundance signal. This makes 
^H-labelling particularly suitable for studying the incorporation of advanced 
intermediates on a biosynthetic pathway [2, 3]. 

The inherent lack of resolution in NMR can be overcome by the use of 
isotope-induced shifts in NMR. The use of as a “reporter” nucleus for both 
hydrogen and oxygen represents one of the great advances in biosynthetic 
studies with stable isotopes and makes use of the observation that substitution 
of a proton alpha or beta to a by deuterium causes a change (usually upheld) 
in the chemical shift. Similarly, the presence of alpha to a atom can be 
detected by an upheld shift in the NMR spectrum. These effects are 
summarised in Fig. 4. When the deuterium label is directly attached to a 
nucleus in the precursor molecule, the p.n.d. NMR spectrum of the enriched 
metabolite shows, for carbons which have retained deuterium label, a series of 
signals upheld of the normal resonance. The presence of each deuterium shifts 
the centre of the resonance by 0.3 -0.6 ppm and spin-spin coupling (7cd) Pro- 
duces a characteristic multiplet; hence CD appears (Fig. 4 a) as a triplet, whereas 
CD2 and CD3 would give respectively a quintet and septet. Shifted signals arising 
from carbons which bear no hydrogen suffer reduced signal-to-noise ratio 
caused by poor relaxation and lack of n.O.e. enhancement, a disadvantage of the 
method which is compounded by the multiplicities due to coupling. Deuterium 
decoupling can assist in this by removing the coupling (see Fig. 7 below). 

However, information not obtainable by direct ^H NMR spectroscopy, such as 
the distribution of label as CH2D, CHD2 and CD3 and the integrity of carbon- 
hydrogen bonds during biosynthesis, maybe gained. 

Many of the problems associated with directly attached deuterium are 
avoided by placing the deuterium label two bonds away from the reporter 
nucleus. The isotope shift, although reduced, is still observable, and as jS-hy- 
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Fig. 4a-d. Simulated proton noise decoupled NMR spectra of a polyketide-derived moiety 
from: a [2-'^C, 2-^H3]acetate; b [l-'^C, 2-^H3]acetate; c [l-'^C,'*02]acetate; d '*02 gas 



drogens only contribute markedly to the relaxation of non-protonated 
nuclei, the shifted signals otherwise retain any n.O.e. also experienced by the 
unshifted signals on proton decoupling. As geminal carbon-proton coupling 
constants are generally small, and carbon-deuterium couplings are over six 
times smaller again, the shifted signals are effectively singlets (Fig. 4b), even 
without deuterium decoupling, and this gives a further increase in the signal-to- 
noise ratio compared with the corresponding a-shift experiment. However, 
neither of these methods provides reliable information on the stereospecificity 
of deuterium labelling. Although NMR spectra are disadvantaged by their 
inherently low dispersion and broad lines, they have the advantage of providing 
information on the stereospecificity as well as regiospecificity of labelling. 
NMR, however, does not prove the number of deuteriums incorporated. 

The biosynthetic incorporation of can also be detected by the observation 
of isotope induced shifts in the NMR spectrum, as shown in Fig. 4c,d. 
The may be conveniently introduced via a doubly labelled precursor or by 
growth in an '®02 atmosphere. The resulting shifts are generally not much larger 
than 0.05 ppm. These are very small effects, the same general size as isoto- 
pe shifts and are only readily observed with high field spectrometers. These 
techniques for elucidation of the origins of hydrogen and oxygen provided the 
basis for much of the work described below. 

Colletodiol (38), a macrodiolide containing a 14-membered ring, was 
originally isolated from the plant pathogen Colletotrichium capsid along with a 
number of related macrodiolides including colletoketol (39), which was sub- 
sequently isolated (as grahamimycin A) from culture filtrates of Cytospora sp. 
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ATCC 20502. Incorporation of ^^C-labelled acetates in C. capsid established that 
colletodiol was polyketide-derived and is formed via and Cg hydroxyacids of 
tri- and tetra-ketide origins respectively as shown in Scheme 12 [26] . Depending 
on the exact structures of the intermediates, a number of mechanisms can be 
proposed for the formation of the lactone and 1,2-diol moieties in colletodiol. 
The origins of all the oxygen and hydrogen atoms have been elucidated by 
incorporation of and [l-^^C,^® 02 ]acetates and ^*©2 gas by cultures 

of Cytospora [27]. The labelling pattern in colletodiol is summarised in Fig. 5. 
Interestingly, no isotope-induced shifts could be observed for C-2 or C-8 
in the NMR spectrum of the [l-^^C,^H 3 ]acetate-enriched colletodiol. It is 
known [28, 29] that carbonyl carbons can be poor “reporter” atoms for 
shifts and the presence of label was shown by direct NMR analysis of the 
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Fig. 5. Incorporation of [1-'^C,'*02]-, l-^HjJacetates and '*02 into colletodiol and an 

acyl substitution mechanism for macrolactonisation 



enriched metabolite. The level of incorporation is essentially uniform at all 
the enriched positions except for C-10 where a very low level was observed. 

From these results it can be concluded that the lactone ring formation occurs 
by an acyl substitution mechanism, as shown in Fig. 5 from the thioester inter- 
mediates (34) and (35) to give the macrocyclic triene (36). Examination of 
models suggested that the triene (36) would adopt a conformation similar to 
that observed for colletodiol and in this conformation epoxidation should occur 
from the more accessible 10i?e,lli?e face of the 2-alkene to give the 105,111?- 
epoxide (37) which on hydrolysis would give colletodiol with the correct 
101?, Hi? stereochemistry and the observed origins of the hydroxyl oxygens. 
Both the triene (36) and epoxide (37) have been prepared, specifically ^H-label- 
led at C-10, from [10-^H]colletodiol, itself prepared by reduction of colletoketol 
with sodium borodeuteride [30]. On feeding to Cytospora, both (36) and (37) 
result in specific incorporation of label at C-10 of colletodiol, proving that 
both the triene and the epoxide are specific intermediates. The level of incor- 
poration of was lower than expected from control experiments, which is con- 
sistent with the observed low level of incorporation from ^H-labelled acetate at 
this position and may be due to a facile redox equilibrium between colletodiol 
(38) and colletoketol (39). Subsequent studies have shown the presence of novel 
13-membered macrodiolides bartanol (41) and bartallol as minor co-meta- 
bolites of colletodiol [31]. It may be envisaged that a concerted rearrangement 
of epoxide (37) would produce the aldehyde (40) which on reduction would give 
bartanol directly with the correct absolute stereochemistry. This appears to 
provide the first example of macrolides in which the ring size has been establish- 
ed by a ring contraction process on a preformed macrocycle. The structure of 
bartanol has been confirmed by an enantioselective total synthesis [32]. 

As indicated in Scheme 12, the labelling studies are consistent with formation 
of triene (36) from thioesters (34) and (35) which themselves maybe built up by 
the assembly sequence shown. Diol (33), in which the stereochemistry at C-3 is 
unknown, is proposed as a common intermediate, elimination to give either an 
E or Z alkene leading respectively to (34) or, after further condensation, re- 
duction and elimination, to (35). Experiments to test this sequence with the 
putative intermediates fed in the form of their M-acetylcysteamine (NAC) 
thioesters have yet to be completed, but, significantly, feeding studies in which 
the putative triketide (34) has been fed as the NAC thioester, as the ethyl ester or 
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Fig. 6. '*0-Isotope induced shifts in the 100.6 MHz p.n.d. NMR spectrum of '*02 enriched 
aspyrone (27) 



as the free carboxylic acid have all failed to provide evidence of intact incorpo- 
ration [unpublished results]. The lack of incorporation could be explained by 
the diol (33) being the immediate precursor for the macrolactonisation reaction 
with dehydration to triene (36) occurring after macrocycle formation. 

The development of and labelling experiments led to a re-examination 
of the formation of aspyrone (27). Incorporation of [l-'^C,'®02]acetate and '®02 
gas revealed the surprising result that none of the oxygens were derived from 
acetate, three being derived from the atmosphere and one from the medium 
[33]. As indicated in Fig. 6, the lactone carbonyl carbon C-2 showed isotope 
shifts due to the presence of aerobically-derived in either of the doubly or 
singly bonded oxygens. These results were subsequently verified in one of the 
few reported applications of labelling [6] and NMR in biosynthetic 
studies [34]. Although NMR active (spin 5/2), oxygen 17 is an insensitive nucleus 
which gives broad lines due to quadrupolar relaxation. In addition, acoustic 
ringing in the NMR probe and pulse breakthrough can cause serious interference 
with the recording of the signal and this may result in severe baseline roll which 
may obscure weak signals. The quality of NMR spectra has been greatly 
improved by application of the maximum entropy method [34] which does 
make its use in biosynthetic studies more attractive. 

Incorporation of [2-'^C,^H3]acetate into aspyrone (27) gave the spectrum 
shown in Fig. 7. As may be seen, simultaneous deuterium and proton noise 
decoupling leads to a great simplification of the otherwise uninterpretable 
results for C-7 and C-10. The retention of two acetate-derived hydrogens at C-7 
makes it unlikely that this carbon could have been part of an olefinic bond at any 
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Fig. 7 a,b. a-^H Isotope induced shifts observed in the 100.6 MHz NMR spectra of aspyrone 
(27) enriched from feeding [l-'^C.^HjJacetate to Aspergillus melleus: a with simultaneous 
'H and noise decoupling; b with 'H noise decoupling only 



point in the biosynthetic pathway. To accommodate these results, a pathway 
involving epoxide-mediated rearrangement and ring-closure reactions was 
proposed. This was further modified by a spectacular series of incorporation 
experiments with putative chain assembly intermediates which were syn- 
thesised in the form of their NAC thioesters with either or double labels 
[35-40]. These experiments resulted in a complete delineation of the assembly 
pathway catalysed by the PKS and remain the most complete and successful 
study of its type. Incorporation of jS-hydroxythioester intermediates, e.g. the 
NAC thioester corresponding to (42) [38, 39], established that the ketoreduction 
process occurs with the same absolute stereochemistry as in fatty acid bio- 
synthesis. This led to the proposal that the PKS involved in aspyrone biosynthe- 
sis could be derived from a fatty acid synthase which presumably had lost the 
capacity to carry out enoyl reductions. The dehydrations were shown to occur 
by a syn elimination [40]. The final hexaketide product (43) of the polyketide 
assembly process would be converted to aspyrone by the sequence shown in 
Scheme 13 which includes an analogous epoxide rearrangement to that pro- 
posed for bartanol biosynthesis above. 

While some success has been reported in analogous studies with polyketide 
assembly intermediates in Streptomyces metabolites, e. g. erythromycin [41] and 
tylosin [42], similar experiments on fungal polyketides have been more limited. 
The di- and tetraketide intermediates (44) and (45), variously doubly labelled 
with and as indicated in Scheme 14, have been incorporated into de- 
hydrocurvularin (46) by cultures of Alternaria cineriae [43]. However, in 
contrast to the ease of incorporation of assembly intermediates into aspyrone by 
A. melleus, the experiments in A. cineriae required considerable experimenta- 
tion to optimise the feeding conditions and the use of the jS-oxidation inhibitors. 
The initial experiments [43] depended on the use of UV mutants of A. cineriae 
which had lost the ability to utilise fatty acids and therefore to degrade the fatty 
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Scheme 13 

acid-like labelled precursors. In a subsequent paper [44] it was reported that a 
range of jS-oxidation inhibitors, and in particular 3-(tetradecylthio)propanoic 
acid, were effective in allowing intact incorporation to be observed. Incorpora- 
tions of up to 70% were obtained, including that of ethyl (7 S)-[6, 7-^^C2,hydroxy- 
^®0]-7-hydroxyoct-2-enoate. The NMR spectrum of the enriched dehydro- 
curvularin showed an isotopically shifted doublet for C-4 and a simple 
doublet due to coupling with no isotopic shift for C-5. 




Our own studies on the biosynthesis of monocerin (50), Scheme 16, in 
Dreschlera ravenelii illustrate the problems which commonly arise in this type 
of study. Incorporations of and ^®0-labelled acetates and gas and 

analysis by and NMR showed inter alia that the oxygen atoms attached to 
C-9 and C-1 1 were acetate derived, so that successive ketoreductions occur with 
opposite stereochemistry; that both hydrogens at C-10 are acetate derived, 
consistent with ketoreduction occurring during chain assembly; and that the 
‘extra’ oxygen at C-4 is derived aerobically [45]. These results are consistent with 
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Fig. 8 a, b. 55.28 MHz NMR spectra of monocerin (50) enriched from feeding: a sodium 
[2, 4, 4, 4-^H4]crotonate (53); b [3-^H]-3-hydroxyhexanoic acid NAC thioester (59) 
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the polyketide assembly sequence shown in Scheme 15 to give the heptaketide (47) 
as the product of the assembly phase. This would cyclise and aromatise and finally 
lactonisation of the C-9 hydroxyl onto the thioester would give the dihydroisocou- 
marin (48) as the first PKS-free intermediate. This would be further converted into 
the final metabolite as outlined in Scheme 16, in which the exact sequence of the 
necessary modification steps is unclear. Support for this sequence does come from 
the isolation of the known fusarentin ether (49) as a co-metabolite of monocerin. 
In order to establish the exact sequence, an enantioselective synthesis of the fusa- 
rentins which will allow preparation of the necessary di- and trioxygenated dihy- 
droisocoumarin intermediates doubly labelled with has been developed [46]. 

A number of putative assembly intermediates have been prepared in label- 
led forms for feeding studies to confirm the assembly sequence shown in Sche- 
me 15. However, none of these were incorporated intact by cultures of D. ravene- 
lii, although all were efficiently metabolised by the cultures [54]. The results fell 
into three main categories. The sodium salts (51) -(53) all showed high levels of 

incorporation by NMR, which corresponded exactly to that observed for 
incorporation of [^Hjlacetate (Fig. 8a). The corresponding thioesters (54) -(57) 
gave a similar labelling pattern, but the incorporation levels were considerably 
lower. Interestingly, the thioesters (58) and (59) gave efficient incorporation of 

label at those positions, including the methoxyls, where one would predict 
incorporation of hydrogen by reductive modification (Fig. 8b). It appears there- 
fore that these precursors are broken down by jS-oxidation with concomitant 
enrichment of the NADPH pool, and that the resultant NADPD is immediately 
used in the biosynthetic pathway to monocerin. 

4 

Stereochemistry of Polyketide Assembly in Fungi: and Studies 

Many of the above studies have given invaluable information on the stereo- 
chemical outcome of the ketoreductase and dehydratase catalysed reactions 
occurring during polyketide assembly in fungi. A number of studies of incor- 
poration of [^Hj] acetate have provided indirect information on the stereo- 
chemistry of the final enoyl reductase reaction. Thus label is found at the pro- 

5 positions at C-11 of cladosporin (60) in Cladosporium cladosporoides [47], 
C-7 of dehydrocurvularin (46) and C-8 of antibiotic A2677 IB (61) in Penicillium 
turbatum [48]. In all cases these results indicate that the enoyl reductase of the 
PKS has the opposite stereochemistry to that of the corresponding FAS. In con- 
trast to the above, the labels at C-2' and C-4' of averufin (62) in Aspergillus 
parasiticus show the same stereochemistry as those in the corresponding fatty 
acids [49]. This result is consistent with the proposed [50] intermediacy of a 
hexanoate starter unit from fatty acid metabolism in averufin biosynthesis. 
Recent genetic evidence has indicated that there is a specialised FAS associated 
with the PKS involved in averufin biosynthesis and in some elegant experiments 
it has been shown that exogenous hexanoyl NAC could be effective in substituting 
for the product of the FAS when the gene is disrupted [51]. 

The stereochemistry of the ketosynthase-catalysed condensation of malona- 
te with the enzyme-bound thioester at the start of each elongation cycle has 
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always been presumed to occur with inversion of configuration at malonate as 
has been established for the corresponding condensation in fatty acid bio- 
synthesis. In collaboration with Heinz Floss [unpublished results], we have 
obtained information for this from the incorporation of (2R)- and (2S)- 
acetates into colletoketol (39) and monocerin (50) and subsequent 
NMR analysis of the enriched metabolites. 

Although best known as a radioactive tracer the nucleus has ideal charac- 
teristics for use as an NMR label. As the natural abundance is practically zero 
and the magnetogyric ratio is the highest for any nuclide (28.53 compared to 
26.75 for ^H), NMR spectroscopy is uniquely sensitive by the standard of 
other NMR methods of tracing isotopes. The chemical shift values and coupling 
constants are very close to those of the corresponding NMR spectrum, so 
assignments can be made on this basis. Accurate integration is also possible. 
Tritium NMR has been used in a number of whole cell and enzyme studies [6] 
including study of the incorporation of ^H-labelled valines into cephalosporin C 
in Cephalosporium acremonium [52], and incubation of [^H]porphobilinogen 
with porphobilinogen deaminase from Rhodopseudomonas sp. [53]. 

As indicated in Scheme 17 for the R-acetate, assuming that carboxylation to 
malonate proceeds with retention of configuration, there are four possible 
tritiated products from the condensation step. Thus on subsequent incorpora- 






Application of Isotopic Methods to Secondary Metabolic Pathways 



25 




4.00 3.00 2.00 1.00 



Fig. 9. 533 MHz NMR spectrum of colletoketol (39) enriched from feeding sodium 
(R)-[^H,^H,'H]acetate (50 mCi, 36 mCi mmol"*) to Cytospora sp. ATCC 20502 



tion into the 4- and 12-methylene groups of colletoketol (39) or the 10 -methylene 
group of monocerin (50), all positions where the stereospecific assignments of 
the diastereotopic hydrogens have been made, the stereochemical outcome can 
be deduced once it is known which of the prochiral hydrogens is incorporated 
along with and which is incorporated along with *H. This should be apparent 
from analysis of the p.n.d. NMR spectrum in which one signal should be 
broadened by coupling to ^H. The results for incorporation of (R)-acetate into 
colletoketol in Cytospora sp ATCC 20502 are shown in Fig. 9. Label is in- 
corporated most effectively into the methyl positions, but there is significant 
incorporation into all four methylene hydrogens. Interestingly, there is no ob- 
servable incorporation into the 2- or 8-alkene positions, in contrast to the results 
when [^Hj] acetate is fed. This is difficult to rationalise, but it may be due to a 
“pool” effect, with the small amount of labelled acetate used in the experiment 
being entirely consumed by the PKS for the first two condensations, with un- 
labelled endogenous acetate being used for subsequent condensations. 

Close examination of the spectrum reveals a sharp singlet at <52.54, indicating 

label at the 4-pro-S position with protium adjacent and a broader signal at 
(52.35, indicating at the 4-pro-R position with deuterium adjacent. This is 
consistent with inversion of configuration. The results for the 12-methylene 
group is less clear as both the pro-R and pro-S hydrogen at <52.14 and 1.85 appear 
as sharp singlets. The results for the 10-methylene group in monocerin are, 
however, also consistent with inversion of configuration. 

No discussion of studies on the stereochemical outcome of polyketide bio- 
synthesis would be adequate without mention of the seminal studies of Jordan 
and Spencer [56-58] using chiral malonates to determine the stereospecificity 
of the elimination and enolisation reactions occurring during 6-MSA (66,R=H) 
and orsellinic acid (66, R = OH) biosynthesis. Early experiments by Abell and 
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Scheme 18 



Staunton in which they compared the relative levels of incorporation at the 
3- and 5-positions from and [2-^HJacetates had already indicated that 

the loss of hydrogen was stereospecific, but were not able to define the stereo- 
chemistry [55]. 

Scheme 18 illustrates the proposed stages in 6-MSA biosynthesis in which the 
first and second condensation steps proceed with inversion to give the triketide 
(63). Ketoreduction gives the alcohol (64) and then elimination followed by a 
final malonyl condensation generates the tetraketide (65) which cyclises via an 
intramolecular condensation and enolises to give the aromatic nucleus of (66). 
In the first set of experiments (R)- and (S)-[l-^^C,^H]malonates were incubated 
separately with 6-MSA synthase purified from Penicillium patulum [56]. Isotope 
incorporations were determined by mass spectrometry. All the possible isotope 
patterns for retention or loss of the pro-J? or pro-S hydrogens from C-3 and C-5 
were permutated. Comparison with the actual spectra obtained demonstrated 
that opposite prochiral hydrogens were eliminated. The absolute stereochemi- 
stry was established in an analogous experiment [57] where the chiral malonates 
were incubated with acetoacetyl CoA rather than acetyl CoA. Subsequent mass 
spectral analysis showed that it is the Hjj proton that is retained at C-3 of 6-MSA 
and so it can be deduced that the hydrogen at C-5 must be derived from the 
opposite prochiral hydrogen, Hs. The overall result is summarised in Scheme 18. 
In a recent collaborative study we have synthesised the triketide alcohol (64) as 
its NAC thioester and shown that it is indeed a precursor as, on incubation with 
6-MSA synthase and malonyl CoA, 6-MSA production is observed [unpublished 
results] . Current work is aimed at synthesis of both enantiomers of (64) to study 
the overall stereochemistry of the ketoreduction and elimination reactions. 

A similar stereochemical outcome to that observed for 6-MSA was found 
using orsellinate synthase isolated from Penicillium cyclopium [58]. The hydro- 
gens retained at C-3 and C-5 of orsellinic acid (66, R = OH) are from opposite 
prochiral sites in malonate. This contrasts with a previous study [59] on 
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orsellinic acid carried out in the same way which had concluded that the hydro- 
gen loss was non-stereospecific. The difference may be ascribed to an unfavour- 
able rate of deuterium isotope exchange relative to the rate of orsellinic acid 
synthesis, leading to racemisation in the latter experiment. 

5 

Incorporation of Fluorinated and Other Substrate Analogues 

There are many examples of polyketide metabolites where the common acetate 
starter unit is replaced by other groups, and a general discussion of these is out- 
side the scope of this review. Numerous studies have been carried out on the use 
of analogues of the normal biosynthetic precursors to produce novel structures. 
This type of “precursor-directed biosynthesis” has been the subject of a recent 
comprehensive review [60]. 

A number of recent studies have investigated the use of fluorine as a bio- 
synthetic label. Fluorine is of interest because of its relatively low cost in com- 
parison with ^^C-labelled compounds and the power of NMR spectroscopy 
which is of comparable sensitivity to NMR. Moreover, efficient incorporation 

of a fluorinated precursor and isolation of the resultant novel metabolite is a 
potential method for generation of analogues for biological testing. Fluorine is 
the most electronegative element, resulting in a strongly polarised C-F bond, but 
it has similar steric requirements to hydrogen. Thus, the electronic consequen- 
ces of replacing hydrogen are significant, affecting substrate-enzyme binding 
affinity and, ultimately, the biological activity of the fluorinated analogue [61]. 
The squalestatins, e.g. (68), are novel inhibitors of squalene synthase and hence 
of cholesterol biosynthesis. They were isolated from a Phoma species and ex- 
tensive studies with and labelled precursors established that the 

dioxabicyclooctane core was formed via condensation of a hexaketide precursor 
primed with a benzoyl CoA starter with a Krebs’ Cycle intermediate, presumably 
oxaloacetate [62]. In an attempt to produce novel analogues for biological 
screening, a variety of halogenated, alkylated and other substituted benzoic 
acids were fed to squalestatin-producing cultures. Incorporation was successful 
with a few of the fluorinated analogues, e.g. (69) and (70), and incorporation 
levels of up to 33% were observed [63] as indicated in Scheme 19. 

In another interesting study on the biosynthesis of the antibiotic tetronasin 
(71), various di-, tri- and tetraketide substrate analogues were synthesised [64, 
65] as their NAC thioesters and fed to Streptomyces longisporoflavus [66]. The 
analogues were different by virtue of the fact that one of the natural pendant or 
terminal methyl groups was replaced by an ethyl, iso-propyl or benzyl moiety. In 
addition to this, the a-hydrogen was replaced by fluorine in several of the sub- 
strates. The synthesis of these precursors involved an interesting application of 
electrophilic fluorinating agents to Evans’ oxazolidinone chemistry and serves 
to highlight again the necessity for carrying out demanding synthetic work in 
many biosynthetic studies. 

The introduction of fluorine had several purposes. These were to provide a 
biosynthetic label for detection of incorporation by ^®F NMR; to produce struc- 
tural analogues of the natural metabolite; and specifically in this case to inhibit 
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the jS-oxidation process which commonly results in degradation of these assem- 
bly intermediates and prevents their intact incorporation. Examples of two 
tetraketide analogues (72) and (73) from the extensive range of modified pre- 
cursors tested are shown in Scheme 20 along with the expected novel tetronasin 
analogues (74) and (75). The generation of novel fluorinated analogues was not 
efficient enough to be detected by NMR. However, in an interesting applica- 
tion of high resolution electrospray mass spectrometry, [M-tK] peaks could be 
seen in electrospray mass spectra of partially purified extracts from the in- 
corporation experiments and the levels of incorporation estimated by com- 
parison with the relevant [M + K]"^ peak of the natural metabolite (Fig. 10). In 
this way, levels of incorporation as small as 0. 1 % could be readily detected. It was 
interesting, but perhaps not surprising, that incorporation was observed only 
with precursor analogues containing fluorine and/or where the methyl was 
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tetraketide intermediate 
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Fig. 10. High resolution electrospray mass spectrum of tetronasin (71) containing analogue 
(74) resulting from feeding of precursor (72). (Figure kindly supplied by J. Staunton) 



replaced by ethyl. The larger structural analogues, i. e. those with benzyl and 
iso-propyl groups, were not incorporated. This indicates that the PKS has some, 
but limited, relaxation of substrate specificity for assembly intermediates. 

The success, albeit limited, of incorporation studies of polyketide assembly 
intermediates has resulted from feeding these in the form of their NAC thio- 
esters which structurally mimic the thiol end of the phosphopantetheine moiety 
found in coenzyme A and the acyl carrier protein component of the PKS. This 
will be discussed further below, but it has also been shown that there are advan- 
tages to feeding starter units in the form of their NAC thioesters. 

There is considerable indirect and direct evidence that the PKS responsible 
for the assembly of norsolorinic acid (NSA, 79), the first isolable intermediate in 
the pathway to the aflatoxins, is primed by a hexanoate starter and, indeed, as 
discussed above, feeding [l-^^C]hexanoic acid to averufin-producing cultures of 
Aspergillus parasiticus was reported to give some intact incorporation. To study 
this further we developed a method for production of NSA ( 79) in shake cultures 
and fed [2-^H3]hexanoate (Scheme 21) to these cultures in the form of the free 
acid (76), the ethyl ester (77) and finally the NAC thioester (78) [67]. NMR 
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Fig. 11 a-d. 0-5 Ppm regions of: a the 400 MHz 'H NMR spectrum of norsolorinic acid (79) as 
its tetramethyl ether; b the 62 MHz ^H NMR spectrum enriched from feeding [2-^H2]hexanoate 
in the form of the free acid (76); c the 62 MHz ^H NMR spectrum enriched from feeding 
[2-^H2]hexanoate in the form of the ethyl ester (77); d the 62 MHz ^H NMR spectrum enriched 
from feeding [2-^H2]hexanoate in the form of the NAC thioester (78) 

analysis of the NS A isolated in each case is shown in Fig. 11. It can be seen that 
whereas the free acid gave a low direct incorporation of label into the 2 '-methy- 
lene against a high background of indirect incorporation via degradation to 
^H-labelled acetyl CoA (as measured by incorporation into the 6 '-methyl 
group), the NAC thioester gave a very high specific incorporation (ca. 45%) 
with only a trace of indirect incorporation. Similar results for the NAC thioester 
have been reported by Townsend who also observed good intact incorporation 
(4-5 %) of the NAC thioester of the “diketide” intermediate [ 1 - ^^C] -3-oxooctano- 
ate into averufin (62) [50]. 

Following the success of the hexanoate thioester feeding, we carried out 
similar experiments with ^H-labelled butyrate, pentanoate and heptanoate. Of 
these, the pentanoate results almost exactly mirrored those for hexanoate, the 
others showing only poor incorporation exclusively by prior degradation to 
acetate. The pentyl analogue (80) of norsolorinic acid was subsequently isolated 
by preparative hplc separation from the natural metabolite and fully characteri- 
sed. Thus it appears that the NSA PKS can accept both pentanoate and hexa- 
noate starters with comparable facility. 
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To probe the substrate specificity further, the incorporation of the NAC 
thioester of 6-fluorohexanoate was examined. The 470 MHz NMR spectrum 
of the metabolite (derivatised as its tetramethyl ether) showed a very strong 
triplet of triplets at <5 -218.2 ppm, similar to that observed in the thioester. The 
mass spectrum showed a large amount of a new metabolite of MW 444 
(Fig. 12b). In the 500 MHz ^H NMR spectrum (Fig. 12a), the 6'-methylene sig- 
nal of the 6'-fluoroanalogue (81) appeared as a doublet of triplets, whereas the 



OH 0 OH 0 




0 



80 R = H 

81 R = CHsF 



6'-methyl signal for the co-produced natural metabolite appeared as a triplet 
at 0.90 ppm. Integration of the signals indicated that the fluoroanalogue com- 
prised ca. 40% of the mixture which again was confirmed by hplc separation. 
In contrast, when the thioester of 2-fluorohexanoic acid was fed, there was no 
evidence whatsoever of fluorine incorporation and the yields of NSA were 
severely reduced, suggesting that the thioester may be acting as an inhibitor of 
the PKS. 




Fig. 12 a. 500 MHz 'H NMR spectra of the tetramethyl ethers of norsolorinic acid (79) and the 
6-fluoro-analogue (81) resulting from feeding 6-fluorohexanoic acid as its NAC thioester 





Fig. 12 b. El mass spectra of the tetramethyl ethers of norsolorinic acid (79) and the 6-fluoro- 
analogue (81) resulting from feeding 6-fluorohexanoic acid as its NAC thioester 



6 

Meroterpenoids 

A major interest has been in the study of a group of compounds of mixed poly- 
ketide-terpenoid origins - the so-called meroterpenoids. Our interest in these 
compounds began with the isolation of andibenin B (82) from A. variecolor. 
Other clearly related structures including andilesin A (83) [ 68 ] and anditomin 
(84) [69] were subsequently isolated and the structures of these C 25 metabolites 
suggested a sesterterpenoid origin. However, the results of incorporation expe- 
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riments [70] with ^^C-labelled acetates and methionine gave a labelling pattern 
indicative of a mixed polyketide-terpenoid biosynthesis (Scheme 22) in which a 
di-C-methylated tetraketide-derived phenolic precursor (85) is alkylated by 
farnesyl pyrophosphate to give the cyclohexadienone (86), which after epoxide- 
mediated cyclisation to the triene (87) undergoes an intramolecular Diels-Alder 
reaction to generate the bicyclo [2.2.2] octane system (88) observed in andibenin 
B and andilesin A. Subsequent oxidative modifications and elaboration of the 
terpenoid portion would generate the c-lactone and spiro-<5-lactone systems of 
(83) and (82) respectively, whereas a carbocation initiated ring expansion of the 
carbocyclic ring of the tetraketide-derived moiety in (88) would generate the 
rearranged ring system observed in anditomin (84). While this work was in pro- 
gress, the isolation was reported of two further metabolites for which sester- 
terpenoid origins were proposed. These were the mycotoxins austin ( 89) and 
terretonin (90) from Aspergillus ustus [71] and Aspergillus terreus [72] re- 
spectively. Similar incorporation experiments with ^^C-labelled acetates and 




(90) 
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methionine gave results that suggested that these were further metabolites of the 
meroterpenoid pathway via the common intermediate (86). Conclusive evidence 
for the meroterpenoid origins of these metabolites was obtained by the synthe- 
sis of ^H-labelled 3,5-dimethylorsellinic acid (85) (as its ethyl ester) [73] and its 
specific incorporation into andibeninB (82) [74],austin (89) [75] and terretonin 
(90) [76]. This was established by NMR analysis of the metabolites isolated 
from feeding experiments with (85) specifically labelled in the 3-methyl group. 
The mode of incorporation of the carbon skeleton of 3,5-dimethylorsellinate 
into these metabolites is summarised in Scheme 23. Whereas the carbocyclic 
ring is incorporated intact into the andibenins (e.g. 82) and andilesins (e.g. 83), 
and undergoes one-bond cleavage only on incorporation into anditomin (84), it 
is fragmented to an unprecedented degree on incorporation into austin ( 89) and 
terretonin (90). Further information on the mechanisms involved in these 
drastic modifications of the tetraketide and farnesyl derived modifications of 
the metabolites have been provided by extensive studies with and 

labelled precursors. 

The labelling pattern obtained from incorporations of [ 1- ^^€,^* 02 ] acetate and 
^*©2 were consistent with formation of the y-lactone in the andibenins (82) and 
andilesins (83) by oxidation of the 1 '-methyl group and subsequent lactonisa- 
tion on to the C-8' carboxyl [77, 78]. The results of incorporation of ^*©2 into 
austin (89) indicated a modification mechanism in which the orsellinate moiety 
undergoes a ring contraction via an a-ketol rearrangement followed by Baeyer- 
Villiger-type oxygen insertion to form the <5-lactone as shown in Scheme 24. 
Low incorporation levels precluded the observation of isotope shifts from 
[l-^^C,^®02]acetate into austin, but this was overcome by synthesising [79] 
3,5-dimethylorsellinate doubly labelled with and in both the carboxyl 






Scheme 23 
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carbonyl and at the C-2 position. This was incorporated with high efficiency into 
austin (Fig. 13), confirming the ^*©2 results [75]. The level of incorporation of 
label into the C-8' carbonyl is approximately half that into the C-6' hydroxyl, 
consistent with y-lactone formation via the free carboxyl, as shown in 
Scheme 24. 

Similar labelling experiments [76] were consistent with the formation of 
terretonin (90) via the same tetracylic carbocation (91) as proposed in the 
biosynthesis of austin (89). Ring contraction as indicated in Scheme 25 would 
generate the cyclopentadione (92). Subsequent aerobic hydroxylation, lactoni- 
sation and carbon-carbon bond cleavage via a retro-Claisen reaction gives the 
intermediate carboxylic acid (93). Methylation and oxidative modification of the 
terpenoid moiety would give the final structure (90). The subsequent isolation 
of the citreohybridones (e.g. 104) and andrastatins (see below) gives added 
weight to this hypothesis. 

and labelling has also provided information on the mechanism of 
formation of the spiro-lactone systems observed in the andibenins (e.g. 82) and 
austin (89) [80]. The mass spectrum of andibenin B (82) isolated from the ^*©2 
labelling experiment showed a major parent ion peak at m/z = 434, indicating the 
simultaneous incorporation of four atoms. The NMR spectrum showed 
that in addition to the y-lactone ether oxygen, the oxygens attached to C-3, C-4 
and C-10 are all derived from the atmosphere. These results show that the C-3 
lactone function must be formed by a biological Baeyer-Villiger type oxidation 
of a corresponding 3-keto precursor (94) (Scheme 26). 






36 



TJ. Simpson 




Fig. 13. '®0-lsotopically shifted resonances in the 100.6 MHz p.n.d. NMR spectrum of 
austin (89) enriched by feeding ['^C,'*0]-3,5-dimethylorsellinate to Aspergillus ustus 
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The presence of atmospheric oxygen at both C-4 and C-10 suggests possible 
mechanisms for the biosynthesis of the spiro-lactone system. As the andilesins 
(e.g. 83) possess a 7-membered lactone ring, it seems reasonable to propose that 
the required skeletal rearrangement to form the spiro-lactone occurs sub- 
sequent to the introduction of the C-4 oxygen atom. A relatively stable carboca- 
tion could be generated from the andilesin lactone ring (95) by alkyl-oxygen 
cleavage. Loss of the C-5 proton would then give the tetrasubstituted alkene (96). 
Subsequent oxidation of (96) to epoxide (97) followed by rearrangement would 
give the carbocation intermediate (98). Three paths for the conversion of (98) 
into the spirolactone (82) can be envisaged. Elimination to the endocyclic or 
exocyclic alkenes (99) and (100) and subsequent epoxidation and hydride 
reduction (paths a and b) would account for the origin of the 10-hydroxyl in 
atmospheric oxygen. Any subsequent mechanism for the formation of the <5-lac- 
tone will result in two atoms of atmospheric oxygen in the lactone. The structure 
of austin (89) which contains both the spiro-lactone and a 9,10-double bond 
provides some support for this general route. An alternative route (path c) in- 
volves trapping of the C-10 carbocation (98) by the carboxyl group to form 
lactone (101) which then undergoes transesterification, giving the A and B rings 
of andibenin B directly. In this case it is not necessary to invoke a further oxi- 
dation step. Prior hydrolysis of lactone (101) to form the spiro-lactone would 
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imply a loss of half the oxygen label at C-3 which is not consistent with the rela- 
tively similar levels of incorporation of atmospheric oxygen observed at C-3, 
C-4 and C-10. 

These possible mechanisms have been distinguished by a series of ex- 
periments in which [5-^H2]-, [6-^U.i] and [5-^^C, 4-^H2]mevalonates were in- 
corporated into andibenin B (82) [80]. Most importantly, these experiments 
showed the retention of three deuterium atoms at C-13 and one at C-9 from the 
appropriately labelled mevalonates. These results rule out intermediates (99) 
and (100), which have double bonds in these positions, and so provide strong 
evidence for path c in Scheme 26. Consistent with the previous results from 
incorporation of [1, 2- ^^C 2 ] acetate, there is no randomisation of labelling 
between C-14 and C-15, so the stereochemical integrity of the gem-dimethyl 
group is retained throughout the biosynthetic pathway which perhaps suggests 
that the alkene (96) is formed by a concerted elimination from the e-lactone 
system (95). 

Austin (89) is clearly biosynthetically related to andibenin B and is formed 
from the same key intermediate (86). An interesting possibility was that the 
tetracyclic intermediate (91) involved in austin biosynthesis (Scheme 24) is not 
formed directly from cyclisation of (86) but is formed via the same bicyclo- 
farnesyl intermediate (87) involved in andibenin biosynthesis (Scheme 22). To 
test this, [6-^^C, b-^Hjjmevalonate was fed to 5-day old cultures and the resultant 
enriched austin isolated and its ^H, noise decoupled nmr spectrum deter- 
mined. This showed clear isotopically shifted signals corresponding to the 
incorporation of two and mainly three deuteriums into the 12-, 13- and 
14-methyl groups. The result for the 12-methyl group excludes the possibility of 
the involvement of (87) in austin biosynthesis. 

The labelling studies described above provide definitive evidence for the 
mixed polyketide-terpenoid biogenesis of the andibenins, andilesins, andi- 
tomins, austin and terretonin. The formation of the bicyclo [2.2.2] octane system 
in the first two classes of metabolite provides a rare example of a biosynthetic 
Diels- Alder reaction. The biosynthetic relationship of austin and andibenin was 
supported by the isolation of austin from another mutant strain of A. variecolor 
[81]. Further metabolites related to austin have been isolated from Emericella 
dentata [82] and Penicillium diversum [81]. Other complex metabolites which 
are almost certainly further products of the meroterpenoid pathway are fumiga- 
tonin ( 102) and paraherquonin (103) which have been isolated from Aspergillus 
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Scheme 27 



fumigatus [83] and Penicillium paraherquei [84] respectively. More recently the 
citreohybridones, e.g. (104), and the closely related andrastatins have been 
isolated from Penicillium citreo-viride [85] and Penicillium sp. OF-3929 [86] 
respectively, providing yet further members of the family. Labelling studies [87] 
with ^^C-labelled acetates and [carboxy, 2-^^C2]-3,5-dimethylorsellinate into 
citreohybridone (104) are entirely consistent with these proposals. These latter 
metabolites are clearly related to terretonin (90) and the likely biogenetic rela- 
tionships amongst these structurally varied metabolites are summarised in 
Scheme 27. 

7 

3-Nitropropanoic Acid: Labelling Studies 

3-Nitropropanoic acid (109) is a toxic metabolite produced in higher plants and 
by several fungi. It has been shown to be produced in Penicillium atrovenetum 
from L-aspartate (105) (Scheme 28). A series of papers has been published on the 
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details of this conversion which provide some excellent applications of the use 
of labelling in conjunction with other stable isotopes. Since has a natural 
abundance of 0.36% and a spin of 1/2, NMR is possible. Although most bio- 
synthetic studies have detected incorporation of via its coupling to the 
direct observation of incorporation by NMR spectroscopy is possible. 

The nitrogen atom in (109) was shown to be derived from aspartate by incor- 
poration of dl-[ 2-^^C,^^N] aspartic acid [88]. In the NMR spectrum of the 
enriched metabolite, the signal corresponding to the C-3 methylene carbon was 
observed (Fig. 14) as a broad singlet due to with ^"^N attached and a sharp 
doublet (/cN 8.7 Hz). The presence of the doublet, offset to lower frequency by 
the associated ^®N isotope shift, confirmed the intact incorporation of the C-N 
bond from aspartate. The origin of the oxygen atoms was elegantly demon- 
strated by fermentation under an atmosphere of ^® 02:^®02 (1:1) in a defined 



‘/cN 8.7Hz 

A 




69.5 69.0 



Fig.14. The C-3 methylene region of the 50 MHz p.n.d. NMR spectrum of 3-nitropropanoic 
acid (109) enriched from feeding DL-[2-'^C,'^N]aspartic acid to Penicillium atrovenetum. 
(Figure kindly supplied by R.L. Baxter) 
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Fig. 1 5 . 36.5 MHz p.n.d. NMR spectrum of 3-nitropropanoic acid biosynthetically enriched 

by growing Penicillium atrovenetum on '^NH4C1 under an '*02 atmosphere. The observed '*0 
isotope induced shift is 0.08 ppm per oxygen atom. (Figure kindly supplied by R.L. Baxter) 



medium in which ^®NH 4 Cl was the sole nitrogen source [89]. The NMR 
spectrum of the isolated 3-nitropropanoic acid showed 2 and 3-bond 
couplings which on integration confirmed a >95% abundance. The 
DEPT NMR spectrum (Fig. 15) showed three signals corresponding to '®N'®02, 
i 5N'®0^®0 and ^®N ^®02 species, proving that both oxygen atoms of the nitro 
group are derived from molecular oxygen. ['®N]-Nitrosuccinate (106) was 
shown to be an intermediate by ^®N NMR analysis of the metabolite isolated 
after feeding the substrate as its diethyl ester [90]. Assignment of the stereo- 
chemistry of the intermediate nitrosuccinate was ascertained indirectly by 
examining the incorporation of deuterium from l-[2, 3, 3-^H3] aspartate into 
(109). The resulting NMR spectrum shows that is retained at both the 

2- and 3-positions, indicating that the stereochemistry of the L-aspartate is 
retained in the oxidation of the amino acid to nitrosuccinate and so makes 
inversion unlikely. As a result of these and further experiments with doubly 

labelled aspartates and 3-nitropropanoic acid which revealed a futile cycle 
involving 3-nitroacrylate (108), the pathway summarised in Scheme 28 was 
proposed [91]. Oxidation of L-aspartate affords (S)-nitrosuccinate which on 
decarboxylation produces and ad-nitro-intermediate (107) as the kinetic pro- 
duct which tautomerises to 3-nitropropanoic acid. When nitropropanoate 
dehydrogenase is present, (107) or (109) may be intercepted and oxidised to 

3- nitroacrylate. It was suggested that this cycle may serve as an additional 
mechanism for regulating NADP/NADPH levels in the cell. 

8 

Isotopic Labelling of Biosynthetic Proteins 

Stable isotope labelling is also proving to have an important role in studies on 
the enzymology of polyketide biosynthesis. The acyl carrier protein (AGP) com- 
ponents of polyketide synthases (PKSs) are believed to play a central role in the 
control of the assembly and stabilisation of polyketide intermediates, especially 
of the highly oxygenated intermediates necessarily involved in biosynthesis of 
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the polycyclic aromatic antibiotics, e.g. actinorhodin ( 110 ) and oxytetracycline 

(111) , produced by Type II PKSs in Streptomycetes [7]. A number of these ACPs 
have been isolated after cloning of the gene and overexpression in E. coli. [92]. 
The full 3D structure of actinorhodin AGP (Fig. 16) has been elucidated by 
application of 2D NMR methods [93, 94]. The AGP contains 14 leucine residues 
out of a total of 82 amino acid residues and a crucial part of the refinement of 
the structure was the biosynthetic labelling of the protein by expression in E. coli 
grown on a minimal medium supplemented with leucine labelled stereospecifi- 
cally in the diastereotopic methyl groups. 

The specifically labelled leucine U14) was prepared [95] as shown in 
Scheme 29 by a chemoenzymatic synthesis in which the stereogenic centre at 
G-4 was established by treatment of the enolate of the propanoyl oxazolidinone 

(112) with GD 3 I. Gleavage of the oxazolidinone followed by two carbon homo- 
logations via a Grignard reaction gave the a-ketoester (113) selectively labelled 
with deuterium. Saponification of the ester and reductive amination of the 
ketone catalysed by leucine dehydrogenase gave (2S,4R)-[5, 5, S-^Hjlleucine. 
Related chemoenzymatic routes can be used to prepare readily a wide range of 
isotopically labelled amino acids for protein structural studies and metabolic 
studies [96]. 




Fig. 16. Molscript diagram showing the restrained minimised structure of actinorhodin apo 
AGP 
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iv) NaOFI then leucine dehydrogenase. 



Scheme 29 



Part of the COSY NMR spectra of the methyl region of the unlabelled 
and labelled AGP are shown in Fig. 17. This shows the absence of one set of 
leucine methyls in the labelled protein. The effect is most clearly seen by looking 
at the signals assigned to leucine 45. These appear at 0.28 and 0.38 ppm in the 
unlabelled spectrum. In the labelled spectrum the signal at 0.38 ppm has been 
completely eliminated, allowing the assignment of the remaining signal to the 
pro-S methyl of leucine 45. 

For in vitro studies of polyketide assembly using the purified components of 
the Type II PKS, it is essential to have the ACPs in their active ho/o-form which 
can then be acylated chemically or enzymatically (Scheme 30). The inactive 
apo-ACPs are converted to the active holo-form by addition of phosphopantet- 
heine from coenzyme A to a conserved serine residue. This addition is mediated 
in each organism by an ACP-holo synthase. In E. coli there is an ACP-holo 
synthase which is part of its endogenous fatty acid biosynthetic machinery but 
the substrate specificity of this enzyme for heterologous ACPs has been shown 





Fig. 17 a, b. 500 MHz DQF-COSY NMR spectra of actinorhodin ACP; a unlabelled; b labelled 
from incorporation of (2S,4R)-[5, 5, 5-^H3]leucine 
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[92, 97] to vary and is generally low for type II PKS ACPs. An interesting ob- 
servation was made during the preparation of ^®N-labelled oxytetracycline ACP 
which was required for 3D NMR studies [98] . Thus E. coli containing the otc ACP 
gene on a heat-inducible expression system was grown at 30 °C to mid log phase. 
The cells were then washed and transferred to minimal medium containing 
^®NH4C1 and immediately induced by heat shock to 42° for 30 min. Subsequent 
growth at 30 °C for 18 h followed by purification of otc ACP revealed that two 
ACPs eluted from the Q-Sepharose column (Fig. 18). These were analysed by 
electrospray mass spectrometry. As expected, the first peak contained the 
apo- ACP in -labelled form with a mass of 10026 Da (10031 Da predicted for 

all 115 nitrogen positions labelled). This fraction contained a trace amount 
of unlabelled apo-ACP (9914 Da measured, 9916 calculated). The second frac- 
tion eluted from the HPLC column contained labelled ho/o-ACP with a mass of 
10367 Da (10 376 predicted for all N positions labelled). Significantly, this spe- 
cies contained approximately 10% of a species of holo-ACP containing no 
label with a mass of 10256 Da (10256 calculated). The unlabelled proteins must 
have been produced prior to addition of label and concomitant induction. Thus 
it appears that the endogenous ACP ho/o-synthase is able to convert low con- 
centrations of apo-ACP to ho/o-ACP with high efficiency, but not the high con- 
centrations produced after induction. This may provide evidence for substrate 
inhibition of the enzyme. 
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Fig. 18 a-c. Purification and ESMS analysis of oxytetracycline acyl carrier proteins after 
growth and induction in minimal medium containing '^NH4C1: a Q-Sepharose FPLC trace of 
E. coli protein fractions containing ACP; b ESMS of '^N-oxytetracycline apo-ACP: note distin- 
ctive envelope of Na"^ adducts; c ESMS of '^N-oxytetracycline ho/o-ACP, again showing Na"^ 
adducts, as well as a significant proportion (ca. 10%) of unlabelled hoZo-ACP 



9 

Conclusions 

There can be little doubt that the application of stable isotope labelling metho- 
dologies has made a major impact on the development of biosynthetic and 
metabolic studies in a wide range of systems. As biosynthetic work becomes 
increasingly focused on studies at the enzymatic level, there will be continued 
scope for the further development of these applications particularly making use 
of sensitive mass spectral detection methods rather than the NMR methods 
which have dominated whole cell studies. We look forward to the results of 
these studies with anticipation. 
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Aliphatic polyketides are a large family of natural products which exhibit an impressive range 
of biological activities. They are synthesised by a common pathway in which units derived 
from acetate, propionate and butyrate are condensed onto a growing chain, which is initiated 
from a range of structurally varied carboxylic acid starter units, in a process closely resem- 
bling fatty acid biosynthesis. The intermediates remain bound to the polyketide synthase 
(PKS) throughout multiple rounds of chain extension, and to a variable degree, reduction of 
the newly formed p-keto functionality. It is the manner in which each different PKS controls 
the number and type of extender units added, and the extent and stereochemistry of reduc- 
tion at each cycle which gives rise to the diverse structural variation seen in this family of 
natural products. Furthermore, the aglycone product of the PKS maybe modified by post-PKS 
enzymes, including glycosidases, methyltransferases, acylases and oxidative enzymes, to pro- 
duce still greater diversity. In this article the development of the field will be exemplified by a 
detailed discussion of the archetypal example of erythromycin A biosynthesis. This will then 
be followed by further discussion of other relevant areas of research in this field. 
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1 

Introduction to Polyketide Structures 

The common feature of polyketide natural products is that they are biosyn- 
thesised by repeated addition of small building blocks, usually comprising two 
or three carbons, to form a linear chain. The resulting chain can be a polyketone 
1 or it can have an array of functional groups formally derived by reductive 
modification of most or all of the keto groups of a polyketone. Typical modified 
residues are shown in 2-4. The length of chain may vary greatly, although the 
majority of poiyketides incorporate from four to ten building blocks. It is these 
heavily reduced aliphatic polyketide chains and molecules derived from them 
which form the subject of this article. Only such poiyketides of terrestrial, and 
not marine origin will be considered. 




The Archetypal Poly-^-keto Chain Structural Variations Produced by Reduction of Keto Groups 



The initial polyketide chain may survive unaltered, but almost always it is 
further elaborated by additional biosynthetic operations to give a structurally 
more complex product. Examples of these elaboration processes are oxidative 
changes, such as hydroxylation, epoxidation or oxidative cleavage, or addition of 
extra structural residues by esterification, alkylation or glycosylation. If the 
elaboration steps involve a cleavage or rearrangement of the primary polyketide 
chain, a structure may result which is radically different from the archetypal 
polyketide skeletal type represented by 5. Even so, the natural product is cate- 
gorised as a polyketide by virtue of its biosynthetic origin. 

The combination of variety in the processes of chain assembly and elaboration 
makes for great diversity of structure; indeed, by taking into account all the pos- 
sible biosynthetic variations it can be calculated that potentially there are tens, 
possibly hundreds, of millions of different structures waiting to be discovered. In 
fact, after decades of searching through metabolites of both marine and terrestrial 
organisms, the number of known compounds runs only into tens of thousands. 
New structures are being discovered at what seems an impressive rate until the 
potential for diversity is taken into account. The present limit of known diversity 
is further evident when rules of structural relationship such as “Celmer’s” rules are 
considered [ 1 ] . The known compounds are clustered into families of related struc- 
tures which may have a common evolutionary origin. The diversity map which 
then emerges is one of a few tiny islands of structure distributed over a blank 
ocean in which potential structures are absent. One of the challenges in this field 
is to understand the genetics of the biosynthetic processes with a view to expanding 
the range of structural diversity by genetic engineering. This goal, which seemed a 
fanciful idea as recently as 1990, had already been achieved five years later. 
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Many natural polyketides have been investigated with differing degrees of 
success and progress. Not surprisingly, most interest has focused on commer- 
cially important compounds which are useful in medicine or agriculture, and 
most progress has been made on one such compound, the antibiotic erythro- 
mycin. Accordingly, in this account erythromycin will be covered first and in 
great detail to show how far the field has developed. Other important biosyn- 
thetic pathways which are beginning to be understood at the genetic level will 
then be presented for comparison and to show the exciting potential for the 
discovery of new compounds offered by this field. 

2 

The Erythromycins 



2.1 

Structures of the Erythromycins 

The parent compound in the erythromycin family, erythromycin A 5, is typical 
of a large family of macrolides which are characterised by rings containing 12, 
14 or 16 atoms [2]. Members of the family show an intriguing common struc- 
tural and stereochemical relationship which was first noted in Celmer’s Rules 
[1]. Erythromycin A was isolated in 1952 from Saccharopolyspora erythraea [3]. 
It is used in clinical medicine against infections caused by Gram-positive 
bacteria, and it is the main treatment for many pulmonary infections such as 
Legionnaire’s disease. The structure was elucidated by classical methods [4] and 
confirmed by X-ray crystallography [5]. Other members of the family occur as 
biosynthetic intermediates and so will be presented in that context later. 



2.2 

Overview of the Erythromycin Biosynthetic Pathway 

The biosynthesis of erythromycin can be divided into two phases (Scheme 1). In 
the first constructive phase of the pathway a set of key enzymes, collectively 
known as the polyketide synthase (PKS), assembles the typical polyketide chain 
by sequential condensation of one unit of propionyl-CoA and six units of 
methylmalonyl-CoA 6. The initially formed chain is cyclised to give the first 
macrocyclic lactone (macrolide) intermediate 6-deoxyerythronolide B 7 [6, 7]. 
In the second phase 6-deoxyerythronolide B is elaborated by a series of “tailo- 
ring” enzymes which carry out regiospecific hydroxylations, glycosylations and 
a methylation (of an added sugar residue) to give finally erythromycin A. The 
core polyketide structure is generated by the PKS in phase one, but the later 
steps of phase two are essential to produce active antibiotics. 



2.3 

The Elaboration Steps of the Pathway 

In a search for new antibiotics or to improve the yields of the existing com- 
pounds, the producing organism was subjected to intensive mutation studies in 
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Scheme 1 



the hope of modifying the biosynthetic machinery. Most of the information con- 
cerning the late stages of the biosynthetic pathway came adventitiously as a 
spin-off from these studies through the chance generation of blocked mutants. 
When a late step on the pathway is blocked by lack of the requisite enzyme the 
previous intermediate may accumulate in sufficient quantities for it to be iso- 
lated and identified. Mutants blocked in each step in the pathway from 6-deoxy- 
erythronolide B 7 to erythromycin A 5 have been produced, allowing the late 
intermediates to be identified [8] (Scheme 2). 

The first step is hydroxylation at C-6 to give erythronolide B 8 [9] . Its inter- 
mediacy was confirmed by incorporation of radioactive 7 [10] and isotope 
dilution experiments [11]. Subsequently, L-mycarose is attached to the C-3 
hydroxyl group of 8 by a TDP-mycarose glycosyltransferase [12, 13], and the 
amino sugar D-desosamine is then added to the C-5 hydroxyl of 9 by the action 
of the enzyme TDP-desosamine glycosyltransferase [13, 14]. The resulting inter- 
mediate, erythromycin D 10, is the first to show antibacterial activity and occurs 
at a branch in the biosynthetic pathway. Either C-12 hydroxylation takes place 
with retention of configuration [15], to produce erythromycin C 11, or 0-methyl- 
ation of the C-3" hydroxyl of the mycarose moiety with SAM, catalysed by an 
0-methyltransferase, produces erythromycin B 12.Finallyerythromycin A 5 can 
be generated either by 0-methylation of 1 1 catalysed by an 0-methyltransferase 
and SAM, or by C-12 hydroxylation of 12. A single 0-methyltransferase operates 
on both branches of the pathway [16]. Similarly a single hydroxylase has been 
implicated [17]. Since this has a 1200- 1900-fold preference for erythromycin 
C over B it is considered that the pathway via erythromycin B is the minor one. 

The biosynthetic pathways leading to the sugar residues have not been 
established in detail, but a speculative pathway has been proposed based on 
genetic analysis [18, 19]. 
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i, C-6 erythronolide hydroxylase; ii, TDP-mycarose 
glycosyltransferase; iii, TDP-desosamine glycosyltransferase; 
iv, C-12 hydroxylase: v, O-methyltransferase 



Scheme 2 



2.4 

Biosynthesis of the Macrolide Core - Classical Precursor Studies 

Corcoran and co-workers first fed labelled precursors to S. erythraea and detec- 
ted their incorporation into 6-deoxyerythronolide B 7 (Scheme 3) [20]. These 
results gave the first evidence that the macrolide core is derived from one pro- 
pionyl-CoA and six methylmalonyl CoA units. Subsequently Cane and co- 
workers [21] fed [^®0]propionate and demonstrated that all the oxygens 
attached to carboxyl-derived carbons in the macrolide core were retained from 
the propionate precursor and were not derived from molecular oxygen or water. 
These results are consistent with reduction of each keto group of the putative 





The Biosynthesis of Aliphatic Poiyketides 



55 



Scheme 3 




polyketone chain to the required oxidation state and stereochemistry, although 
the timing of the reduction steps in relation to the chain extension processes 
remained to be established. 

In studies using blocked mutants, no intermediates preceding 6-deo- 
xyerythronolide B were isolated, implying that these intermediates are either 
too unstable to accumulate or enzyme-bound [8]. Subsequently in an important 
pioneering experiment. Cane and Yang investigated a possible diketide inter- 
mediate in which the keto group has been reduced to hydroxyl [22] . The free acid 
showed random incorporation into erythromycin B 12 indicating that the 
diketide had been degraded to propionate before being used in the biosynthesis. 
However, the iY-acetylcysteamine (NAC) thioester analogue 13 was incorpo- 
rated intact with retention of the coupling between the two simultaneously 
labelled sites of enrichment (Scheme 4). To demonstrate that 13 was not oxi- 
dised to the jS-keto thioester before incorporation an analogous deuterium 
labelled diketide 14 was fed and was again incorporated intact, thus eliminating 
this possibility [23]. The iY-acetylcysteamine (NAC) thioester was employed 
because it exhibits a high degree of structural homology with the thiol terminus 
of coenzyme A and of the 4'-phosphopantetheine group of the active acyl carrier 
protein. This experiment, with a similar contemporary pioneering contribution 
from Hutchinson et al. [24], opened up a highly productive phase of research 
into the biosynthesis of other polyketide metabolites (see later). From these 
results it can be concluded that the diketide ketoester intermediate produced by 
the first chain extension is reductively processed prior to the addition of the 




Scheme 4 



Erythromycin B 12 
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second chain extension unit. There is also compelling indirect evidence from the 
genetics of the polyketide synthase (PKS) that each successive new keto group is 
fully processed to its final reduced state prior to the subsequent chain extension 
reaction. This “processive mechanism” seems to apply widely for the chain 
assembly steps of aliphatic polyketide biosynthesis as clearly shown by many 
similar feeding experiments for other polyketide systems, e.g. the incorporation 
of 13 and 14 into methymycin in Streptomyces venezuelae which demonstrated the 
similarity of the initial stages of methymycin and erythromycin biosynthesis [25] . 



2.5 

The Reactions of Fatty Acid Biosynthesis 

All early attempts to isolate active PKS proteins from organisms producing 
aliphatic polyketides proved unsuccessful. Speculations concerning their mode 
of action were therefore guided by the strong parallel with the fatty acid bio- 
synthetic pathway which has long been investigated at the enzyme level [26,27]. 
Formation of a fatty acid is initiated by condensation of a starter unit (com- 
monly acetate) with an extender unit (malonate) (Scheme 5). The resulting 



SH 

O 

R'^SCoA (Start that 
c^lewith 
starter acyl) 





(Release 
n cycles) 




Scheme 5 



FATTY ACID CHAIN EXTENSION CYCLE 
KS s Keloacyl Synthase: ACP • Acyl Carrier Protein; KR > Ketoreductim; DH ^ 
Dehydr^ase; ER s Enoybeduct^; AT ^ Acyttransfetase; TE z Thioaeterase 
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jS-ketoester is then fully processed (reduced, dehydrated and reduced again) to 
give an elongated saturated fatty chain. A second chain extension cycle then 
takes place with the condensation of a new extender unit, followed by complete 
reductive processing of the new keto group. To carry out this repetitive sequence 
of operations the fatty acid synthase (FAS) requires a minimum set of catalytic 
activities, with each activity being responsible for one step of the cycle of reac- 
tions, i. e. acyltransferase (AT), jS-ketoacyl synthase (KS) and acyl carrier protein 
(AGP) for chain elongation; jS-ketoreductase (KR), dehydratase (DH), and enoyl 
reductase (ER) for processing of the j3-keto group; and thioesterase (TE) for 
hydrolytic release of the full length chain. Remarkably, a typical FAS requires 
only a single set of these enzymes to carry out all the successive chain extension 
cycles. Control over chain length is achieved by finely tuned substrate specifici- 
ty of key enzymes, notably the KS and TE components. 

The constituent proteins of a E AS can be organised in markedly different ways 
depending on the organism. In bacteria, the individual activities (domains) are 
freely dissociable and can be isolated separately [28]. In animals and yeast the 
domains are bound together by peptide links to form large highly organised 
non-dissociable multi-domain proteins [26, 27]. The multidomain proteins are 
designated “Type I” and the dissociable synthases are designated “Type II”. The 
same nomenclature has been adopted in the PKS field. 



2.6 

Sequencing of the Genes Coding for the Erythromycin PKS 

The first clue to the structure of the erythromycin PKS came with the sequenc- 
ing of the corresponding genes. These were located in the vicinity of the gene 
coding for erythromycin resistance ermE. This method was pioneered by 
Hopwood and Sherman who used it to identify the genes responsible for the 
actinorhodin PKS [29]. 

On both sides of ermE there are regions of DNA with open reading frames 
thought, on the basis of sequence homology with known enzymes, to encode for 
non-PKS proteins responsible for the late stages of the biosynthesis, from 
6-deoxyerythronolide B 7 to erythromycin A 5. Many of these assignments have 
been confirmed by targeted disruption of these genes to give mutants from 
which late intermediates have been isolated (see above). A map showing these 
regions of the genome is presented in Fig. 1. 

Sequencing further away from the resistance gene, the Leadlay group in Cam- 
bridge [30] and Katz and co-workers [31] at the Abbott Laboratories jointly dis- 
covered three exceptionally large open reading frames, each containing about 
10 kbp of DNA designated eryAI, All and AIIL Each of these genes codes for a 
giant (>3000 amino acids) multifunctional protein. It was proposed therefore 
that the erythromycin PKS consists of three giant proteins called DEBS 1, DEBS 
2 and DEBS 3 respectively [30 - 34]. In the primary sequence of the putative pro- 
teins it was possible to identify regions with motifs showing strong sequence 
homology with the active site motifs of proteins involved in fatty acid synthases. 
It was this homology that convinced the investigators that they had discovered 
the genes coding for the elusive PKS proteins. 
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Fig. 1. Map of the S. erythraea genome containing genes associated with the late stages of the 
erythromycin biosynthetic pathway 
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Fig. 2. Predicted domain organisation of the DEBS Proteins. Ketoacyl Synthase (KS); Acyl 
Transferase (AT); Dehydratase (DH); Enoyl Reductase (ER); Keto Reductase (KR); Acyl 
Carrier Protein (ACP); Thioesterase (TE). Each domain is represented by a box with coded 
shading whose length is proportional to the size of the domain; (KR) indicates an inactive KR 
domain. The ruler indicates the residue number within the primary structure of the con- 
stituent proteins. Linker regions are shown in proportion 



In the linear representations of the primary sequence of the proteins shown 
in Fig. 2, sections thought to be associated with specific catalytic activities are 
indicated by coded blocks. These catalytic centres are termed domains. Each 
domain is thought to be folded to form a localised globular structure with a cata- 
lytic role dictated by its specific active site. The various regions between the 
domains which are unmarked in this diagram are thought to play a passive but 
vital structural role in maintaining the various domains in the correct topology 
for co-operation in the overall catalytic process. Most of these structural regions 
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consist of short sections of amino acids rich in alanine, proline and charged resi- 
dues, up to 30 residues long, which are thought to serve as linkers between the 
globular domains. There are also intriguing much longer sections of primary 
sequence in all three proteins preceding the KR domains which cannot be asso- 
ciated with any specific catalytic activity. 



2.7 

Organisation and Function of the Erythromycin PKS 

To help explain the synthetic implications of this organisation of the domains a 
different representation of the PKS is shown in Scheme 6, in which domains are 
represented by circles and the linker regions and structural residues are ignored. 
In all there are six ketosynthase (KS) domains, one for each chain extension 
cycle. Following each KS there are two domains, AT and AGP, essential for the 
condensation step, plus an appropriate set of optional domains KR, DH, ER, for 
modification of the new keto group (except module 3). To aid the analysis, the 
optional domains are raised above the line of the essential domains in this 
cartoon version of the structure. It can be seen that the activities are organised 
into six “modules” (two per protein), each of which is able to carry out a con- 
densation step using the three essential domains followed by modification of the 
keto group in the newly formed keto ester to some extent (except module 3). 
There is a striking correspondence between the set of domains in successive 
modules and the extent of keto group modification in the newly added C3 unit 
of successive intermediates. At the start of the first protein, DEBS 1, there is a 
“loading” module for the starter acid (propionate) which consists of an AT 
linked to an AGP. Module 6 is followed by a thioesterase domain which is thought 
to catalyse lactonisation of the polyketide chain and so release the completed 
macrolide structure from the enzyme. 

According to this modular analysis, each protein catalyses two cycles of chain 
extension. The term “cassette” has been proposed for the giant proteins [34]. All 
three cassettes in the erythromycin cluster are bimodular, but in other systems, 
such as the rapamycin [35] and tylosin [36] PKSs, the size of a cassette can vary 
from one to six chain extension modules. The three cassettes co-operate in some 
way to form an extraordinarily complex molecular assembly line. The bio- 
synthetic intermediates remain PKS-bound throughout the whole synthetic 
sequence via thioester links. A challenging feature of this organisation is the 
mechanism which controls the ordering of the cassettes in the assembly line so 
that the transfer of the growing chain from one cassette to the next is correctly 
controlled. 

The correspondence between the domain composition of successive modules 
and the structure of each newly added G3 unit in successive intermediates is 
persuasive but it does not prove that the modules function in the order indica- 
ted. The protein could fold in many different ways to bring sets of non-con- 
tiguous domains together to form functional modules. It was a high priority to 
establish that modules are indeed made up of adjacent sets of domains as 
indicated, and also that the individual modules and domains are used in the 
order shown. 
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The first strong supporting evidence came from Katz and co-workers who 
carried out pioneering gene disruption experiments. In 1991 they disrupted the 
p-ketoacyl reductase (KR) in module 5 and from the resulting mutant they 
isolated a partly processed erythromycin analogue 15, in which a keto group has 
survived at the predicted position of the macrolide in place of the normal 
hydroxyl group [37]. The presumed product of the PKS is therefore the 5-keto 
analogue 16 of 6-deoxyerythronolide B (Scheme 7). More recently the enoyl 
reductase (ER) in module 4 was disrupted and an analogue 17 of erythromycin 
was isolated containing a double bond at C-6/C-7 (Scheme 8) [38]. It can be con- 
cluded that the disrupted keto reductase operates in the fifth cycle of 
chain extension and that the disrupted enoyl reductase operates in the fourth 
cycle as predicted by the simple successional arrangement of modules in 
Scheme 6. 

These two experiments also had important implications for genetic engineer- 
ing of polyketide biosynthesis to produce novel metabolites. The formation of 
the two structural analogues of the polyketide chain and the fact that they are 
correctly lactonised shows that the structure of the nascent chain may not play 
a critical role in polyketide biosynthesis, and at least some altered poiyketides 
can be substrates for further cycles of chain extension. This implies that the 
correct transfer of the growing chain from one synthase unit to the next may 
reside more in the specific juxtaposition of the various domains rather than in 
the conventional substrate specificity of a particular synthase domain for the 
structure of the approaching chain. It is interesting and encouraging that the 
product released from the PKS in both experiments was at least partly 
processed towards an analogue of erythromycin A, showing that the elabora- 
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tion enzymes also possess a useful degree of relaxation in their substrate 
specificity. 



2.8 

Isolation and Structural Studies of the Erythromycin PKS 

All three DEBS proteins have been overexpressed and successfully purified to 
homogeneity [39]. Gel filtration indicated that all were dimeric under native 
conditions [39], and a DEBS multienzyme has since been confirmed to be ex- 
clusively homodimeric under the conditions of analytical ultracentrifugation 
[40]. The dimeric character of the PKS proteins mirrors the behaviour of the 
animal EAS and further reinforces the parallels in their properties. 

The first information concerning the structure of the DEBS homodimers 
came from limited proteolysis studies [40-42]. For each DEBS protein the 
cleavage pattern was highly specific and reproducible under given conditions. 
Sets of fragments were obtained containing one or more intact domains pro- 
duced by cleavages in the linker regions predicted on the basis of sequence 
alignments. The results are summarised for DEBS 1 in Fig. 3. 

Significantly, the most rapid cleavages under conditions of limited proteolysis 
were always found to occur between the two chain extension modules housed in 
each multienzyme [40-42]. Other initial cleavages split the N-terminal loading 
didomain from DEBS 1 and a didomain consisting of the C-terminal offloading 
thioesterase from DEBS 3 together with its attached AGP domain [39, 40]. With 
the exception of the loading didomain, all of the fragments corresponding to 
intact modules behaved as homodimers on gel filtration. 
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Fig. 3 . Pattern of fragments generated by controlled proteolysis of DEBSl 

More extensive proteolysis of DEBS 1, DEBS 2 and DEBS 3 gave smaller 
fragments [40-42]. Most significantly, the six KS domains were released as 
didomains with their attached AT domains and all of these were homodimeric. 
In contrast, fragments consisting of single domains associated with keto group 
modification were all monomeric: the KR domains of modules 1, 3, 5 and 6, and 
the single ER domain from module 4. 

Proteolysis of DEBS dimers in which the thiols at the active sites of co-opera- 
ting AGP and KS residues had been crosslinked by prior treatment with 
1,3-dibromopropanone gave a different pattern of fragments consistent with the 
formation of crosslinks between two complementary protein chains [40] . It was 
therefore concluded that the KS of one chain co-operated with the AGP residing 
in the complementary chain rather than in its own, as had been demonstrated 
earlier for the animal PAS. 

The Gambridge group which carried out these experiments rationalised the 
results in terms of a dimeric “double helical” structure for the DEBS proteins, 
illustrated for DEBS 3 in Pig. 4 [40]. To arrive at the double helix the two poly- 
peptide chains of the DEBS are first folded to give domains and then lined up 
side by side in a parallel “head to head”, “tail to tail” fashion as shown. The two 
chains are then twisted together to form a double helix with the KS, AGP and AT 
domains of each module forming the core of the structure. The helix is shown in 
ribbon form to aid clarity in the positioning of the domains. The sense of the 
helical twist is arbitrary at this stage. The twist brings the AGP of one chain 
below the KS domain of the identical module in the complementary chain 
allowing the observed inter-chain cross-linking. 

The optional reductive domains in each module needed for ketone group 
modification (DH, ER and KR) form loops that protrude out sideways from the 
central core, while maintaining the active sites of all reductive domains within 
range of the phosphopantetheine group on the adjacent AGP in the same chain. 
There is therefore no contact between reductive domains of complementary 
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Fig.4. Proposed folding of DEBS 3 to give the “double helical” model 



chains which is consistent with the monomeric nature of these domains in the 
isolated state (see Fig. 3). In contrast the KS domains (and TE domain of module 
6) are in intimate contact in the core of the helical structure and so it is rea- 
sonable that the proteolytic fragments containing them wordd retain the homo- 
dimeric character of the intact protein. Three such double helices stacked on top 
of the other give rise to the complete erythromycin PKS. 

Further evidence concerning the arrangement of domains of the DEBS 
homodimers came from a complementation study in which two genetically 
engineered versions of the protein were made, one defective in the KS domain of 
module l,the other defective in the KS domain of module 2 [43]. Neither protein 
was active but activity could be restored by preparing a heterodimer containing 
one chain of each type. The work also established that a given AGP co-operates 
with KS domains in complementary chains. An alternative model for the 
erythromycin PKS has been proposed in which the homodimer is formed with 
the complementary chains running antiparallel side by side. The resulting flat 
modules are then stacked one above the next as in a stack of pancakes with 
linker regions running vertically between linked modules at the outside of the 
stack (Pig. 5). 

The architecture of the homodimers is crucial to understanding how these 
fascinating synthases operate. This subject is bound to be a high priority for 
future work both in the PKS and the animal FAS fields. 



2.9 

Genetic Engineering of the Erythromycin PKS 

The pioneering studies by the Katz group have already been discussed in sup- 
port of the proposed modular structure of the PKS. A common characteristic of 
these genetic engineering experiments was the essentially destructive character 
of the mutations. The activity of individual domains could be destroyed without 
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Fig. 5. Proposed model for DEBS based on the proposed linear “head to tail” model for the 
animal FAS 



destroying the overall capacity of the PKS to make macrolide structures. This 
strategy could therefore be used to generate novel products, but clearly the range 
of possible variations was very limited. 

The erythromycin PKS has more recently been genetically engineered in 
many different ways. Most work to date has relied upon an expression plasmid 
pRM5 developed by Hopwood and co-workers for expression of genes in 
Streptomyces coelicolor, an organism which normally produces an aromatic 
polyketide, actinorhodin [44]. In initial experiments it was demonstrated that 
the whole of the erythromycin PKS could be transferred to S. coelicolor to give a 
mutant with the capacity to produce the unelaborated macrolide product, 
6-deoxyerythronolide B 7 as well as the analogue with an acetate starter unit 
[45] . This mutant has also proven to be a more efficient vehicle for incorporation 
experiments with added precursors than the natural host, S. erythraea [46]. The 
development of this highly efficient vehicle for genetic engineering led to rapid 
progress in the production of modified polyketide products. 

The Leadlay Staunton group made the first significant breakthrough by 
repositioning a domain within the PKS to show that the repositioned domain 
could carry out its normal type of reaction in its new context on a foreign 
substrate. This entailed adding a copy of the thioesterase (TE) domain to the 
terminus of DEBS 1 [47] . The aim was first to prevent further chain extension by 
blocking the docking of the C-terminus of DEBS 1 onto the N-terminal end of 
DEBS 2 (this was the predicted consequence of this change according to the 
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Add copy of IE to DEBS 1 




Scheme 9 



(18) 



‘helical’ model for the PKS structure). It was further reasoned that the TE 
domain would play an active role in the engineered protein by catalysing the 
release of the triketide intermediate from module 2 as a d-lactone. 

A mutant of S. erythraea containing the engineered protein (called DEBS 
1-TE) (Scheme 9) did give the anticipated 6-lactone 18. As expected, erythromy- 
cin production was shut down completely. A vital control experiment was then 
carried out to establish the second point - that the relocated domain was acting 
as an active agent of chain release rather than just a passive block to further 
chain extension. A further mutant was made in which a genetically engineered 
dead copy of the TE was placed at the end of DEBS 1 . The inactivated TE differed 
from the normal enzyme only in the replacement of the key serine residue at the 
active site by an alanine; this conservative change should not alter the folding of 
the protein significantly. This control mutant did not produce erythromycin 
showing that the block to further transfer of the chain still existed. The <5-lactone 
18 was produced but in much reduced yield, demonstrating that the TE in the 
first mutant was playing an active role in chain release. 

Further examples of this strategy have since been reported by Cane et al. 
Their first experiment mirrored the design of DEBS 1-TE in that it involved 
placing the thioesterase at the end of DEBS 1. The construct was engineered in 
a different way, however, to yield a protein with a significantly different primary 
structure, so it is appropriate that it was given a slightly different name: DEBS 
1 -t TE [48]. The TE has since been relocated to the terminus of modules 5 [48], 
and 3 [49]. Products consistent with truncation of chain extension at the ex- 
pected stages were obtained from both mutants (Scheme 10). The truncation at 
module 5 was especially significant because it led to release of the hexaketide 
intermediate as a 12-membered ring macrolide 19. The truncation after module 3 
caused the formation of two tetraketide products, 21 and 22; the second is 
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possibly formed from a ketoacid 20 by decarboxylation after release from the 
enzyme. 

A potentially more versatile strategy for generating novel products is the 
transfer of domains between foreign PKS clusters. This would amount in effect 
to “spare part surgery” or “mix and match” swapping of structural residues 
between different natural products. In the first demonstration of the feasibility 
of this radical concept, sections of two different Type I PKS systems (rapamycin 
and erythromycin) were hybridised to produce a hybrid Type I PKS (Scheme 11) 
[50]. The experimental model was DEBSl-TE and the experiment involved 
replacing the AT of module 1 by an AT derived from the rapamycin PKS (this will 
be described later). The transplanted AT specifies a malonate unit as chain 
extender rather than methylmalonate. <5-Lactones 23 and 24 were produced 
lacking a methyl group at C-4, but otherwise the normal structure was main- 
tained at C-2, C-3 and C-5. The hybrid PKS therefore produced the predicted 
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The shaded AT domain is derived from module 2 of the rapamycin PKS 
(Scheme 27) and the remainder ot the hybrid PKS is as in DEBS 1-TE (Scheme 9) 

Scheme 11 



hybrid product. Other successful hybridisation experiments have been des- 
cribed [36,51]. 

The successful generation of these productive, truly hybrid, PKSs opens up 
exciting new avenues for genetic engineering of Type I PKS systems. They show 
that domains or modules drawn from different synthases can co-operate to give 
a working synthase and therefore that different synthases are probably closely 
compatible in structure. Furthermore it points the way to more ambitious and 
versatile genetic engineering experiments leading to the production of a wide 
variety of novel structures including novel antibiotics, immunosuppressants 
and other types of active compound. 

2.10 

In Vitro Experiments with DEBS Domains and Proteins 

In some of the early structural studies of the isolated DEBS proteins it was 
demonstrated that specific domains were catalytically active. For example, clea- 
vage of DEBS 1 with trypsin gave the N-terminal didomain together with two 
other fragments, comprising modules 1 and 2 respectively. It was found that the 
N-terminal didomain was specifically radio -lab died with [^^C]propionyl-CoA, 
after incubation, providing the first evidence for its proposed role as the 
“loading module” for the propionate starter unit. In contrast, modules 1 and 2 
were specifically acylated with [^^C]methylmalonyl-CoA, indicating that the 
other two acyltransferases were enzymatically active after proteolysis [41, 42]. 
Using intact DEBS proteins it was discovered that all the AT domains of the chain 
extension modules specifically catalysed the hydrolysis of (S)-methylmalonyl- 
CoA but were inert to the (K)-isomer. This was the first evidence that the (S)-iso- 
mer of methylmalonyl-CoA serves as building block for both chain extension 
modules of DEBS 1 (see later) [42]. 

In 1995 the engineered, self-sufficient PKS DEBS 1-TE was isolated in pure 
form [52] .When incubated with the appropriate building blocks, propionyl-CoA 
and methylmalonyl-CoA, and a reducing agent, specifically NADPH, the protein 
catalysed the formation of the 6-lactone triketide product 18 (Scheme 12). This 
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was the first demonstration of product formation in vitro by any modular PKS. 
Given the high state of purity it was established that the PKS could carry out its 
normal function without assistance of external proteins. The PKS showed a 
more relaxed degree of substrate specificity in its loading domain than was 
apparent in vivo. «-Butyryl- and isohutyryl-CoA esters were acceptable as 
starter acyl groups in vitro, to give novel products 25 and 26 as well as the acetyl 
and propionyl starter acyl groups found to be effective in vivo (see Scheme 12). 

A mechanistically important result to emerge from this study was the 
rigorous determination of the stereochemistry of the methylmalonate units 
used for chain extension in modules 1 and 2. This was of interest because the two 
branching methyl groups in the lactone product have opposite stereo- 
chemistries. In principle it was possible that either the (K)-isomer of the methyl- 
malonate was incorporated at one site and the (S) -isomer at the other by a 
common incorporation mechanism, or that a single enantiomer was incorpora- 
ted at both sites in which case the two modules would have to follow different 
sequences of reactions. It was unambiguously shown that the ( S)-isomer served 
for both chain extensions. It is unlikely, therefore that altered products with 
different stereochemistry will be generated by transferring AT domains and 
some other means will have to be devised to achieve this desirable end. 

Similar results have come from studies of the DEBS 1 +TE system in vitro [53]. 
When DEBS 1 + TE was incubated with appropriate precursors in the absence of 
the reducing agent NADPH the keto 27 and the pyrone 28 analogues of the 
triketide lactone were obtained (Scheme 13) [53, 54]. This latter result resembles 
the similar formation of a pyrone product by the animal EAS and the 6-methyl- 
salicylic acid PKS under similar conditions [55]. Clearly, the jS-ketoester inter- 
mediates can be transferred from one module to the next without further pro- 
cessing, even in modules which are geared to the production of a hydroxyester 
intermediate. 

Other significant in vitro studies include the demonstration that a cell free 
extract of all three DEBS proteins expressed heterologously in S. coelicolor was 
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Scheme 13 



capable of synthesising 6-deoxyerythronolide B 7, albeit with rather low 
efficiency [56]. The engineered protein module3+TE has been purified to 
homogeneity [57]. In collaboration with a pure extract of DEBS 1 this produced 
the tetraketide products 21 and 22 (see Scheme 10). It maybe possible to use one 
of these systems to catch the two cassettes in contact as the developing chain is 
passed from one to the other by treatment with appropriate cross-linking 
reagents. Unfortunately, attempts so far have not met with success and the 
nature of cassette docking in multi-cassette PKS systems remains a mystery. 

The substrate specificity of the thioesterase domain is of crucial concern in 
the quest for a rational basis for genetic engineering of PKSs to produce novel 
products: it would be futile to engineer a PKS system to produce a particular 
novel product if there is no effective mechanism for its release from the final 
AGP. The didomain from the end of DEBS 3 was over-expressed in E. coli to give 
milligram amounts of protein [58]. The resulting protein was shown by electro- 
spray mass spectrometry to have the correct molecular weight for the didomain 
with an apo-ACP in which the phosphopantetheine group had not been added. 
This deficiency did not matter as far as the planned studies of the TE specificity 
were concerned. In the preliminary work it was shown that the protein was able 
to bind p-methylphenylsulphonyl fluoride (PMSF), a standard inhibitor of 
chymotrypsin-like enzymes, to the hydroxyl of a specific serine residue located 
in the putative active site of the enzyme. This evidence strongly supported a 
proposed cleavage mechanism involving initial transfer of the developed acyl 
chain from the thiol of the neighbouring AGP to the hydroxyl group of an 
active serine residue on the TE (Scheme 14). The resulting oxygen ester is then 
cleaved by attack of a suitable nucleophile to release the product. In forming a 
macrolide ring (the presumed normal role of the thioesterase), the nucleophile 
would be the appropriate hydroxyl group at the distant terminus of the poly- 
ketide chain. 

To assess its substrate specificity, the thioesterase was next challenged with 
synthetic acylesters derivatised as p-nitrophenyl and M-acetylcysteamine (NAG) 
esters (exemplified in Scheme 15) [59]. A wide range of structurally varied sub- 
strates with both leaving groups was cleaved. By electrospray mass spectrometry 
it was shown that acyl enzyme intermediates were formed as predicted. The 
resulting acyl enzyme intermediates could be cleaved either by water to give the 
corresponding acid, or by an added alcohol such as ethanol to give an ester. None 
of the substrates tested gave any macrocyclic lactone even when there was a 
hydroxyl group elsewhere in the chain at a suitable distance from the acyl 
terminus. The work established that the TE has alternative mechanisms for 
product release (hydrolysis or ester formation) when lactonisation is not pos- 
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X = p-nitrophenyl or SNAC 

Scheme 15 




1.0: 1.5 



sible. Information from studies such as these can guide genetic engineering 
studies towards productive goals and away from experiments doomed to failure 
because of incompatibility of the TE domain with the target novel polyketide 
product. 

2.11 

Summary 

The erythromycin PKS provides the structural and mechanistic paradigm 
against which other synthases will be compared both in vitro and in vivo. Equally 
important are the impressive genetic engineering technologies which have been 
developed for modification of this synthase and which should be more widely 
applicable to other synthases. Despite these rapid advances, there remain 
enormous gaps in our understanding of the structure and function of these 
challenging systems. 
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3 

The Avermectins 

The naturally occuring avermectins are a family of eight, pentacyclic structures 
containing a sixteen-membered macrolide ring which are produced by Strep- 
tomyces avermitilis (Fig. 6) [60]. These eight compounds differ in structure due 
to the variability of PKS starter unit at C-25, the 0-methylation pattern at C-5, 
and the hydration-dehydration pattern at C-22/C-23. Other closely related natu- 
ral products include the milbemycins and the nemadectins. The avermectins 
exhibit potent broad spectrum antiparasitic activity, and the semi-synthetic 
Ivermectin 29 (hydrogenated avermectin Bl) displays even greater efficacy 
[60,61]. 

Feeding studies have shown the avermectin aglycone to be derived from 
seven acetate and five propionate units [62]. The isobutyrate and 2-methylbuty- 
rate starter units derive from catabolism of L-valine and L-isoleucine respective- 
ly [63]. [^®0]Acetate and propionate feedings have shown that all the oxygen 
atoms other than those attached to C-6 and C-25, are derived from their 
carboxylate precursors [62] . The C-25 oxygen is most probably derived from the 
starter unit and that of the furan from molecular oxygen. The three methoxy 
groups derive from L-methionine and the oleandrose units from glucose [63, 64] . 

At the genetic level a 95 kb gene cluster has been cloned and demonstrated to 
be responsible for avermectin biosynthesis (Fig. 7) [65]. Sequencing of the 
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cluster revealed structural organisation for a type I PKS with strong similarity 
to that of the erythromycin PKS. A central 70 kb fragment contains two con- 
vergent transcripts, aveAI and All, which appear to encode multifunctional 
proteins with six modules present in each transcript. The first module of aveAI 
demonstrates the presence of a loading didomain and first extension module as 
observed for the erythromycin PKS. Further genes responsible for post PKS 
modifications and regulation both flank and occur between the two PKS 
genes. 

The range of naturally occurring avermectins has been extended by utilising 
a “mutasynthesis” approach [66]. The S. avermitilis blocked mutant employed 
lacked a branched chain decarboxylase activity which is responsible for pro- 
duction of the PKS starter acids. Feeding of the “natural” starter acids restored 
the production of natural avermectins, whereas some 40 “unnatural” starter 
acids led to many novel avermectin analogues modified at C-25. Doramectin 30, 
an extremely effective antiparasitic treatment marketed as Dectomax, was iso- 
lated after feeding cyclohexane carboxylic acid 31 (Scheme 16). The NAC thio- 
ester analogue 32 of the diketide biosynthetic intermediate was also incorpora- 
ted into 30 using this mutant strain (see Scheme 16) [67]. This example, and 
those for the tetronasin producer discussed later, are the only examples to date 
of the incorporation of an analogue of a natural PKS-bound precursor. 

The avermectin PKS loading module has been used to generate a hybrid PKS 
system by replacing the loading module of DEBS 1-TE. The new hybrid PKS 
produced new hybrid polyketide (triketide lactones) which incorporated the 
isobutyrate and 2-methylbutyrate starter acids of avermectin biosynthesis, as 
well as the normal acetate and propionate starter units of erythromycin bio- 
synthesis [51]. 

4 

Tylosin 

Tylosin 33 from Streptomyces fradiae is a representative member of the large 
family of 16-membered macrolide antibiotics commonly utilised in veterinary 
medicine. The aglycone core, tylactone 34 is constructed from two acetate, five 
propionate and one butyrate units [68, 69]. The processive mechanism for its 
biosynthesis was demonstrated by the pioneering contributions of Hutchinson 
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and co-workers who demonstrated the incorporation of NAC thioesters of 
di- and triketide intermediates (Scheme 17) [24]. Further supporting evidence 
comes from the isolation of partially elaborated intermediates of chain elonga- 
tion in mutants of S.fradiae [70]. 

The gene cluster for tylosin biosynthesis is known [71], and PKS encoding 
regions for the biosynthesis of both tylactone and the structurally very simi- 
lar compound platenolide 35 in Streptomyces ambotaciens have been discus- 
sed [36]. The gene organisation, but not the sequence was published. The 
platenolide PKS was reported to consist of five ORFs which encode two bimo- 
dular and three monomodular multifunctional proteins. The reported domain 
sequence within the modules matches that required for the platenolide agly- 
cone synthesis, and the first protein contains a loading module consisting of 
a KS-AT-ACP tridomain. This information was utilised in the construction of 
a hybrid PKS consisting of the tylactone PKS loading module in place of that 
of the platenolide PKS. The mutant strain carrying this new hybrid PKS was 
shown to produce the new hybrid polyketide 16-methyl platenolide 36 as 
expected. 
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5 

Nargenicin A and the Solanapyrones A and B 

Certain cyclic moieties present within some polyketide structures are con- 
sidered to occur via an intramolecular Diels-Alder reaction. Such a process has 
been proposed for several metabolites, of which nargenicin A and the solanapy- 
rones A and D are the most thoroughly investigated. 

Feeding studies to Nocardia argentinesis with labelled NAC thioesters of 
di-, tri-, tetra- and pentaketide intermediates demonstrated regiospecific in- 
corporations into nargenicin 37 (Scheme 18) which are consistent with an 
intramolecular Diels-Alder cyclisation of a linear nonaketide intermediate 
(Scheme 19) [72,73]. 




NHAc 



Scheme 18 





Scheme 19 
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The probable intermediacy of a Diels-Alderase in the biosynthesis of the 
solanapyrones A and D 38 and 39 in Alternaria solani was demonstrated by the 
incorporation of a deuterium labelled isomer (H * = D) of pro-solanapyrone II 
40a (which is first oxidised to pro-solanapyrone III 40b), yielding predomi- 
nantly the exo isomer 38 as a single enantiomer (Scheme 20) [74]. Spontaneous 
cyclisation of 40b in aqueous solution gave predominantly endo isomer 39, 
which was of course racemic. These results have been reinforced by a study 
which investigated a cell free extract of A. solani [75]. Incubation of 40b with 
this extract under specific conditions gave 25% yield of 38 and 39 in an 
exolendo ratio of 53:47. Control experiments using both denatured enzyme 
(10% conversion) and the absence of cell free extract (15% conversion) gave 
38 and 39 in an exolendo ratio of 3:97. This allowed calculation of a 15% en- 
zyme catalysed yield of 38 and 39 with an exolendo ratio of 87:13, which was 
further shown to be > 92% ee for 38. That the biologically observed exo selec- 
tivity cannot be achieved by chemical means alone is remarkable, and the 
combined in vivo and in vitro results leave little doubt as to the presence of a 
Diels-Alderase, although such a protein has not been characterised. 

6 

Polyether Antibiotics 

Polyether metabolites are ionophoric antibiotics which have their major 
applications in the area of animal husbandry. The distribution of oxygen 
atoms in these structures predisposes them towards the complexation of 
alkali metal ions, and their mode of action is considered to depend upon the 
ability to disrupt the sodium-potassium ion balance across cell membranes. 
Monensin A from Streptomyces cinnamonensis and tetronasin from Strepto- 
myces longisporoflavus are extensively studied examples of these meta- 
bolites. 
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6.1 

Monensin A 

Monensin A 41 is derived from five acetate, seven propionate and one butyrate 
units, with an extra 0-methyl group derived from L-methionine [76, 80]. The 
NAG thioester of a postulated triketide chain elongation intermediate has been 
incorporated intact into monensin A by Patzelt and Robinson. This experiment 
required the extremely careful use of several p-oxidation inhibitors and other 
additives to minimise precursor degradation [77]. 

The most intriguing question which remains concerning the biosynthesis of 
polyether antibiotics such as monensin A is that of the mechanism by which the 
series of ether rings are formed. In 1965 Westley et al. proposed a mechanism for 
the biosynthesis of lasalocid and isolasalocid acids from a common diepoxide 
precursor which differed only in the regiochemistry of the cyclisation process 
[78]. In 1983 Cane and co-workers extended this idea to monensin A biosynthe- 
sis in which an acyclic (post PKS) triene precursor 42 would be converted into 
triepoxide 43, mediated presumably by a cytochrome P-450 system, and then a 
series, or cascade, of intramolecular epoxide ring openings forms, after a final 
oxidation step, monensin A 41 (Scheme 21) [79]. Indirect evidence to support 
this hypothesis was demonstrated by [^®0]carboxylate and [^®0] -molecular 
oxygen feeding experiments which showed that the oxygen atoms at the postu- 
lated epoxide-derived positions (^®0) are derived from molecular oxygen [80]. 




Scheme 21 
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Biomimetic studies of this proposed cascade have provided further tantalising, 
albeit incomplete, supporting evidence [81]. Feeding studies with a NAC thio- 
ester of the required (£,£,£)-triene intermediate 42 in labeled form have so far 
been unsuccessful, although this may well be due to problems of transport 
across the cell membrane and degradation by p-oxidation pathways [82]. 

More recently an alternative biosynthetic pathway has been proposed by 
Townsend and Basak (Scheme 22) [83]. This alternative model involves the 
iterative syn-oxidative cyclization of a (Z,Z,Z)-triene intermediate 44 in contrast 
to the (£,£,£)-triene intermediate 42 of the epoxide cascade mechanism. The 
putative, alkoxy-linked, non-heme metal-oxo species 45 undergoes a [2-1-2] 
cycloaddition to yield the metalloxetane intermediate 46. Reductive elimination 
then results in an overall syn-addition of the two oxygen atoms 47. Oxidation of 
the metal intermediate 47 to 48 and two further [2 -I- 2] cycloaddition-reductive 
elimination-metal oxidation cycles would result, after final hydroxylation, in 
monensin A 41. 

In model studies McDonald and Towne have shown that oxochromium 
reagents can oxidise model (Z)- and (£)-hydroxydienes 49 and 50 to yield 



OH 







Scheme 22 
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49 




50 



Scheme 23 



PCC 




bis-furans (Scheme 23) [84]. The formation of 51 and 52 from the (Z)-hydroxy- 
diene, and 53 and 54 from the (£)-hydroxydiene provide support for the syn-oxi- 
dative cyclisation mechanism. This work has been extended to ReO 3 reagents for 
the formation of a tris-furan system which is a core component of another poly- 
ether metabolite, gonianin [85]. 



6.2 

Tetronasin 

Tetronasin 55 is derived from seven acetate and six propionate units and a single 
L-methionine derived 0-methyl group [86]. The origin of C-33 and C-34 remains 
obscure. Feeding experiments with NAG thioesters of di-, tri- and tetraketide 
chain elongation intermediates have been performed and the successful 
approach involved feeding to senescent (late) cells in which p-oxidative degra- 
dative pathways were minimal [87, 88]. It is worth noting at this point that in all 
current examples of feeding experiments with NAG thioesters, the proposed 
intermediate is only incorporated intact at the correctly predicted extension 
module, indicating some molecular recognition factor inherent to this process. 
This was extended in the study of tetronasin biosynthesis where all four possible 
diastereoisomers of the proposed tetraketide NAG intermediates were prepared 
and fed. Only the ‘correct’ diastereoisomer was incorporated intact [88]. These 
results are consistent with recent in vitro studies which demonstrate that the 
erythromycin diketide NAG intermediate specifically acylates the jS-keto- 
synthase/acyl carrier protein domain of module 2 in DEBS 1 + TE [89]. 

In further studies several unnatural analogues of di-, tri- and tetraketide 
chain elongation intermediates have been incorporated into tetronasin ana- 
logues 55 a- e (Scheme 24) [90]. These analogues contained either fluorine 
atoms in replacement of the a-proton or ethyl in place of methyl side chains, 
or a combination of the two. Isopropyl and benzyl side chains were not incorpo- 
rated. The incorporation levels of 0.1 -1.0% were significant, and those ana- 
logues which contained a-fluorine substitutions generally gave the highest 
incorporation levels. This substitution with fluorine has negligible stereo- 
chemical significance, and was designed to inhibit the a, p-dehydrogenation 
step of the p-oxidation degradative pathway. 

A novel sodium ion templated polyene cyclisation reaction has been propos- 
ed for the biosynthesis of tetronasin, based on biomimetic studies [91]. This 
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55 Ri = H, R2 = R3 = R, = Me (Tetronasin) 
55a Ri = F, Rj = R3 = R4 = Me 
55b R, = F. R2 = Me, R3 = El, FI, = Me 
55c R, = H, Rj = B, R3 = R, = Me 
55d R, = F, R 2 =Ra = Me, R4 = Et 

Scheme 24 55e R, = H, R2 = Me, R3 = El, FI, = Me 



biomimetic approach utilised a base-catalysed cascade which established two 
rings and four stereogenic centres in a model system closely related to a putative 
tetronasin precursor. The product obtained was different from the required 
structure only in the stereochemistry at C-4. 

7 

Ansamycin Antibiotics 

The extensively studied rifamycin B 56 from Amycolatopsis mediterranei and 
ansatrienin A 57 from Streptomyces collinus exemplify this class of polyketide 
antibiotics. A common structural feature of these metabolites is the “ansa” 
bridge system of mixed (acetate/propionate) polyketide origin which is initiated 
from a “mCyN” PKS starter unit. The presence of a (biosynthetically unique) 
mCyN unit is recognised by a six-membered carbocyclic ring carrying the extra 
carbon and nitrogen atoms in a meta arrangement. This structure is generally 
quinonoid or benzenoid in nature, and found either with minimal further 
modification, as in 57, or as part of a naphthalenic structure, as in 56. 



o 
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Scheme 25 



Evidence for the initial steps of rifamycin B biosynthesis can be seen in the 
accumulation of a “tetraketide” chain elongation intermediate P8/1-0 58 from a 
mutant of A. mediterranei [92]. This evidence and the demonstration from 
feeding experiments that the carbons on either side of the ether link in the ansa 
bridge derive from the same propionate unit [93] indicate that this ether moiety 
is formed after the initial biosynthesis of a fully extended polyketide chain. 

The specific and proximate precursor of the mCyN unit in ansamycin poiy- 
ketides is 3-amino-5-hydroxybenzoic acid 59 (AHBA) [94]. The biosynthesis of 
AHBA has recently been described by Floss and co-workers from the initial 
branch point of the shikimic acid pathway prior to 3-deoxy-D-arahino-heptulo- 
sonic acid 7-phosphate (DAHP) [95]. The pathway shown in Scheme 25 was 
delineated by feedings of the proposed AHBA precursors, in labelled forms, to 
cell-free extracts of both the rifamycin B producer A. mediterranei S699 and the 
ansatrienin A producer S. collinus Till 892. In these experiments each of the com- 
pounds 61-64 was converted into AHBA with generally increasing efficiency. 
Most importantly the shikimate pathway compound DAHP cannot replace 
phosphoenolpyruvate 61 and erythrose 4-phosphate 60, or aminoDAHP 62 as 
the precursor of AHBA 59. 

An mCyN unit is also present in the core of the polyketide antibiotic 
asukamycin from Streptomyces nodosus subsp. asukaensis [96]. This has been 
shown to arise not from a variant of the shikimate pathway, but from the 
condensation of a C 4 unit from the TCA cycle, closely related to succinate, with 
a C 3 unit, possibly dihydroxyacetone phosphate, from the triose pool. Related 
studies concerning 3-amino-4-hydroxybenzoic acid biosynthesis in Strepto- 
myces murayamaensis mutants MC2 and MC3 support this hypothesis [97]. 

The macrocyclisation or extra chain attachment to the mCyN unit of these 
metabolites leading to the formation of an amide bond, is presumably catalysed 
by an amidase activity. Floss has reported the possible identification of a gene 
resposible for such an enzyme in the gene cluster for rifamycin B biosynthesis in 
A. mediterranei [98]. 

The ansamycin polyketide ansatrienin A 57 also contains a fully reduced 
cyclohexane carboxylic (CHC) unit attached to the macrocyclic structure via a 
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D-alanine residue. Other polyketides which contain such an arrangement are the 
trienomycins [99], while similar CHC chain terminating (synthase starter) units 
are found in a branch of asukamycin [96], and in the co-cyclohexyl fatty acids of 
certain thermophilic bacteria [100]; substituted CHCs are also found as PKS 
starter units in the rapamycin family of polyketides (see Sect. 8). The cyclohexyl 
moieties of these compounds have been demonstrated to derive from the shiki- 
mate pathway. 

Almost the entire pathway from shikimic acid 65 to CHC 70 has been 
delineated by experiments with the ansatrienin A producer S. collinus Tii 1892 
[101]. All putative intermediates for this pathway were prepared in labelled 
form and fed to the producing organism and the 57 was isolated and was 
characterised by NMR spectroscopy. Based on these results the pathway shown 
in Scheme 26 was proposed. The initial step is a 1,4-conjugate elimination of 
the hydroxyl group at C-3 and a proton at C-6 which gives rise to a cross-con- 
jugated dihydroxy diene 66. This then undergoes reduction of the double bond 
conjugated with the carbonylgroup. A further 1,4-conjugate elimination of the 
C-4 hydroxyl group and C-1 proton gives 67. Reduction to 68 is followed by a 
further reduction and dehydration to 69. The final steps involve isomerisation 
to bring the remaining double bond into conjugation and reduction to yield 
70. It is especially noteworthy that in this pathway the sequence of dehydra- 
tions and double bond reductions ensures that no intermediate is ever aro- 
matic. 

Comparable studies have been performed for the formation of co-cyclo- 
hexyl fatty acids in Alky dob acillus addocaldarius and the pathways are iden- 
tical [100]. A recent publication concerning this later pathway has shown that 
the final remaining stereochemical ambiguity, the stereochemistry of proton 
loss at C-6 in the initial 1,4-conjugate elimination of shikimate occurs with 
loss of thepro-6R proton [102]. This mirrors the stereochemistry of “normal” 
shikimate metabolism in the formation of chorismate from 5-enolpyruvyl- 
shikimate 3-phosphate. 
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8 

Rapamycin, FK506 and FK520 

The structurally related compounds rapamycin 71, FK506 72 and FK520 73 
demonstrate antitumour, antifungal and immunosuppressant activities. This 
latter immunosuppressant activity has generated great interest due to applica- 
tions in the therapeutic area of organ transplant surgery, and a detailed model 
for their mode of action has been developed [103]. Due to the similarity of their 
structures and the parallel biosynthetic studies, rapamycin will be the focus of 
this section and cross reference will be made where appropriate. 




Rapamycin obtained from Streptomyces hygroscopicus subsp. hygroscopicus is 
biosynthesised from seven acetate and seven propionate units, with 0-methyl 
groups derived from L-methionine [104]. Competitive incorporation studies 
with radiolabelled precursors demonstrated that the pipecolate ring is derived 
from lysine via free pipecolate [105]. The fra ns-dihydroxycyclohexane carbo- 
xylic acid (DHCHC) ring is derived from shikimate [106], and recent studies 
have shown that free DHCHC can act as a precursor of the cyclohexyl unit [107]. 
Related experiments with Streptomyces hygroscopicus subsp. yakushimaensis 
have elaborated some of the steps involved in the formation of the cyclohexyl 
starter unit for FK520 [108]. The early steps closely resemble the biosynthesis 
of CHC in S. collinus as previously described. Shikimic acid undergoes a 
1,4-conjugate elimination of the C-3 hydroxyl and C-6 hydrogen followed by 
reduction, double bond isomerisation and finally reduction to the saturated 
DHCHC. 

The entire biosynthetic gene cluster for rapamycin biosynthesis has been 
sequenced and published [35]. The PKS genes were identified by hybridisation 
with DNA from the PKS genes for erythromycin biosynthesis. Sequencing 
beyond the PKS region identified other genes that are predicted to be involved 
in the late stages and regulation of rapamycin biosynthesis [109, 110]. The 
sequence and organisation of a gene encoding a four-module PKS protein which 
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Scheme 27 
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is responsible in part for the biosynthesis of FK506 in Streptomyces tsukabaen- 
sis has also been published [111]. 

As expected the PKS for rapamycin showed a Type I organisation strongly 
reminiscent of the erythromycin PKS, with catalytic activities arranged in 
modules (Scheme 27) and with sets of modules housed in turn in three multi- 
modular cassettes designated RAPS 1, RAPS 2 and RAPS 3. RAPS 1 contains 
modules 1 to 4, RAPS 2 modules 5 to 10, and RAPS 3 modules 11 to 14. The 
domain structure of the rapamycin PKS may not correspond in every detail to 
the pattern expected from the proposed structure for the PKS product however. 
In modules 3 and 6, there appear to be potentially active KR and DH domains 
which are not required; module 3 also contains a potentially active but functio- 
nally redundant ER domain. It is possible that the active sites of these extra 
domains have been inactivated in a way that is not apparent from the primary 
sequence, and that the now redundant protein residues have still to be edited out 
by the random processes of evolution. There is also a chance that all these 
domains are indeed active and that the true rapamycin PKS product is more fully 
reduced than that shown. Extra post-PKS reoxidations would then be required to 
reintroduce the oxygen functionality at the relevant sites in the final structure. 

The loading module comprises three domains. The first (CL) shows homo- 
logy to ATP-dependent carboxylic acid-CoA ligases, the second is a putative 
enoyl reductase (ER) and the third an AGP. The probable sequence of operations 
starts with the enoic acid 74 derived from shikimic acid which is reduced by the 
ER domain. The first domain will activate the carboxylic acid to an active acyl 
derivative ready for transfer to the thiol residue of the AGP. The final saturated 
product will end up attached to the AGP as a thioester derivative ready for trans- 
fer to the KS domain of the first chain extension module. The timing of the 
reduction in this sequence of operations cannot be predicted. 

Chain termination in rapamycin biosynthesis is not performed by a thio- 
esterase as in the erythromycin PKS, but is thought to be effected by a speciali- 
sed multidomain protein coded by the gene rapP [109]. This gene has strong 
similarity to genes involved in non-ribosomal peptide biosynthesis [112] and 
the corresponding protein is believed to catalyse the formation of the ester and 
amide bonds to pipecolic acid (Scheme 28). The final product of the PKS is 
coupled to the nitrogen of the pipecolate unit, and the carboxyl group of this 
unit is esterified with the first free hydroxyl at the remote end of the polyketide 
chain to yield the first enzyme free intermediate 75. The order of these steps is 
not known. It is this key multidomain protein, designated the pipecolate incor- 
porating enzyme (PIE), that provides the pivotal link between polyketide and 
polypeptide biosynthesis. Isolation and characterisation studies have recently 
demonstrated that this enzyme is monomeric [113]. 

An enzyme has been isolated from the FK520 producer which is believed to 
be the key one responsible for inserting pipecolic acid into the macrocycle [114]. 
It is reported to be dimeric and activates pipecolic acid and several structural 
analogues in an ATP-dependent reaction to give an enzyme-bound amino-acyl 
adenylate. There is evidence that this then reacts to form a thioester linkage to 
the enzyme. This mechanism of activation is the same as that found in the non- 
ribosomal biosynthesis of peptide natural products such as gramicidin [112]. 
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9 

Aliphatic Polyketides from Fungi 

While the majority of polyketides derived from fungi are aromatic in nature, 
there are many examples of aliphatic polyketides which have enjoyed a great 
deal of study in recent years. Before discusing some of these in detail, however, 
it is worth considering the origin of carbon extender units required for these 
pathways. 



9.1 

Origin of the Carbon Atoms 

The extender units of the polyketide extension process in fungi are always 
acetate derived. Intact propionate incorporation has been reported only for the 
starter acid of a few metabolites. The pendant methyl groups which are observed 
in many metabolites have been demonstrated to arise from the terminal 
S-methyl group of L-methionine via S-adenosyl methionine (SAM). 

An example of such a polyketide is cubensic acid 76 from Xylaria cuhensis 
[115]. From an initial viewing of this structure it could be considered to derive 
from a mixed acetate/propionate pathway. Feeding studies however clearly 
demonstrate that the underlying carbon chain is derived from eleven acetate 
units with all of the eight methyl branches having their origin in L-methionine 
(Fig. 8). 



9.2 

Modularity of the Methylation Process 

The mechanism by which methyl branches are incorporated into these com- 
pounds is still unclear. It is unlikely that a single discrete methyltransferase 
catalyses this process due to the quite different structure at each site methylated. 
It is possible that a series of discrete enzymes could each catalyse a single methyl- 
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Fig. 8. Biosynthetic origin of the carbon atoms and proposed modular assembly of cubensic 
acid 



ation, although the large number of individual recognition events required 
make this unlikely. Furthermore, many such methyl groups occur at reduced 
methylene positions for which an enzymic alkylation process utilising SAM is 
unreasonable. 

A more satisfying proposal is that methyl transferase activities are present 
within the polyketide synthase, which catalyses the formation of carbon-carbon 
bonds at activated methylene groups of the jS-keto thioester intermediates. This 
idea has been postulated by O’Hagan and co-workers, who have predicted a 
domain and module sequence of the putative PKS responsible for cubensic acid 
76 biosynthesis (see Fig. 8) [115, 116]. 

Evidence for such a modular pathway has been provided from studies into 
the biosynthesis of the polyketide backbone 77 of (4P)-4-[(£)-2-butenyl]-4- 
methyl-L-threonine 78 which is incorporated into cyclosporin A in Tolypo- 
cladium niveum [117]. The proposed biosynthesis of 77 is presented in 
Scheme 29. In vitro studies using a cell extract have verified unambiguously 
that the biosynthetic mechanism is processive, that the first PKS free inter- 
mediate is the tetraketide 79, and that methylation unequivocally occurs at the 
stage of the enzyme bound 3-oxo-4-hexenoic acid thioester 80 which is the 
triketide product from the second elongation cycle. These and other results 
indicate that the methyl transferase activity is inherent in the second module 
of the putative PKS. 

At the genetic level preliminary evidence has been reported for a type I PKS 
implicated in the biosynthesis of the of the triol moiety of lovastatin (mevino- 
lin) produced by Aspergillus terreus. This describes the presence of a methyl- 
transferase domain within the PKS sequence [118]. 



9.3 

Evidence for a Processive Mechanism: Feeding Studies 

The biosynthesis of fungal aliphatic poiyketides by a processive mechanism is 
further demonstrated by the successful incorporation of chain elongation inter- 
mediates as their NAC thioesters in several experiments [119, 120]. These 
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experiments and the work of his own research group in this area are discussed 
in greater detail by Tom Simpson in Chap. 1. 



9.4 

The Epothilone Antibiotics 

Worthy of a final note are the recently discovered epothilones. This family of 
compounds which has been isolated from several strains of Sorangium cellu- 
losum [121] has attracted significant attention due to its potent biological activi- 
ty and exciting mode of action [122]. Epothilones display narrow spectrum 
antimycotic activity against Mucor hiemalis, but are extremely cytotoxic 
towards animal cells. The mode of action is similar to that of taxol in stabilising 
microtubules during mitosis. In fact the epothilones are capable of displacing 
taxol from the requisite cellular binding site and are several orders of magnitude 
more active against multiply resistant cell lines. Epothilone B 81 which contains 
a methyl group attached to the epoxide ring has been shown to possess signifi- 
cantly greater activity than epothilone A 82. 

While biosynthetic studies have yet to report on the epothilones, their struc- 
ture is presumably elaborated from a polyacetate pathway with extra pendant 
methyl groups added from L-methionine; the structure also contains an 
interesting gem dimethyl grouping at C-4. The unusual thiazole starter unit can 
be predicted to arise either from cysteine directly with post-PKS addition of a C 2 
unit (probably acetate) and cyclisation, or from an W-acetylcysteine, or from a 
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81 R = CH 3 , Epothilone B 

82 R = H, Epothilono A 



Starting thiazole structure prederived from cysteine and acetate. The impressive 
cytotoxicity of these structures has given rise to a flurry of interest in their total 
synthesis which was first performed for both 82 and 81 by Danishefsky and 
co-workers [123, 124]. 
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1 

Introduction 

The range of functionality provided by the 20 amino acids found in proteins 
consists of weak acids and bases, nucleophiles, hydrogen bond donors and 
acceptors, and the redox active thiol/disulfide. This limited range of chemistry is 
inadequate for the catalysis of many reactions found to occur in biological 
systems. Therefore, a variety of small organic molecules, called cofactors, co- 
enzymes, or vitamins, have evolved to broaden the limited range of chemistry 
that can be catalyzed by simple proteins. 
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The chemistry of the cofactors has provided a fertile area of overlap between 
organic chemistry and biochemistry, and the organic chemistry of the cofactors 
is now a thoroughly studied area. In contrast, the chemistry of cofactor bio- 
synthesis is still relatively underdeveloped. In this review the biosynthesis of the 
cofactors shown in Fig. 1 will be described. Heme and cobalamin will be omit- 
ted as these are covered in the review by Allan Battersby and Finian beeper in 
this volume. We will focus on cofactor biosynthesis in Escherichia coli because 
the relevant genetics and biochemistry have been most intensively studied in 
this organism and it is the easiest system in which to carry out molecular biology 
[1]. Enzymes from other sources will be described only if the corresponding 
enzyme from E. coli has not been isolated or subjected to mechanistic studies. 
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Fig. 1. The cofactors covered in this revietv 
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Molybdopterin Guanine Dinucleotide 
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Fig. 1 (continued) 



Even though E. coli is a very well-studied bacterium, many interesting 
mechanistic problems in cofactor biosynthesis in this organism remain un- 
solved. The mechanisms for the formation of the nicotinamide ring of NAD "^.the 
pyridine ring of pyridoxal, the pterin system of molybdopterin, and the thiazole 
and pyrimidine rings of thiamin are unknown. The sulfur transfer chemistry 
involved in the biosynthesis of lipoic acid, biotin, thiamin and molybdopterin is 
not yet understood. The formation of the isopentenylpyrophosphate precursor 
to the prenyl side chain of ubiquinone and menaquinone does not occur by the 
mevalonate pathway. None of the enzymes involved in this alternative terpene 
biosynthetic pathway have been characterized. The aim of this review is to focus 
attention on these unsolved mechanistic problems. 

2 

Nicotinamide Adenine Dinucleotide Biosynthesis 

Nicotinamide adenine dinucleotide (NAD"^ and NADP"^) is the biochemical 
hydride donor/acceptor [2]. 



2.1 

Biosynthetic Pathway 

The biosynthesis of NADIP)"^ in E. coli is outlined in Fig. 2 [3]. Oxidation of 
aspartic acid to the imine 14 followed by condensation with dihydroxyacetone 
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Fig. 2. The nicotinamide adenine dinucleotide biosynthetic pathway 



phosphate gives quinolinic acid. Ribosylation, followed by decarboxylation, 
adenylation and amide formation, completes the biosynthesis of NAD"^. An 
additional phosphorylation gives NADP"^. 



2.2 

Enzyme Overexpression and Purification 

The current status of the overexpression and purification of the nicotinamide 
adenine dinucleotide biosynthetic enzymes is summarized in the table [3] . 
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Gene 


Sequenced? 


Overexpressed? 


Purified? 


Other source [4] 


Reference 


nadB 


yes 


yes 


yes 




5-7 


nadA 


yes 


yes 


no 


no 


6.8 


nadC 


yes 


yes 


yes 




9 


nadD 


no 


no 


no 


no 


3 


nadE 


yes 


yes 


yes 




10. 11 


nadF 


no 


no 


no 


no 


12 



2.3 

Mechanistic Highlights 



2.3.1 

Aspartate Oxidase 

A mechanistic proposal for this flavoenzyme, analogous to the mechanism of 
monoamine oxidase [13], is outlined in Fig. 3. No mechanistic studies have been 
described. 



2.3.2 

Quinolinate Synthase 

This enzyme has been overexpressed but not purified due to instability. The 
mechanism for the formation of quinolinic acid is unknown. A proposal is out- 
lined in Fig. 4. 



2.3.3 

Quinolinate Phosphoribosyl Transferase 

The mechanism for this reaction has not been studied. A proposal is outlined in 
Fig. 5 [9]. 
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Fig. 3. Mechanistic proposal for aspartate oxidase 
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HO OH 
18 



Fig. 5. Mechanistic proposal for quinolinate phsophoribosyl transferase 



3 

Riboflavin Biosynthesis 

Flavin containing cofactors are chemically versatile and participate in oxygen 
activation and in 1- and 2-electron transfer reactions [14]. In its biochemically 
active forms, flavin occurs as flavin mononucleotide (FMN) and flavin adenine 
dinucleotide (FAD). 
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3.1 

Biosynthetic Pathway 

The biosynthesis of FAD is summarized in Fig. 6 [15, 16]. Loss of C-8 and 
pyrophosphate from guanosine triphosphate gives 36. Deamination of the 
pyrimidine, ring opening of the ribose, reduction, and dephosphorylation gives 
39. Condensation of this with 40 gives lumazine 41. Two molecules of lumazine 
undergo a remarkable coupling reaction to give riboflavin. Phosphorylation 
followed by adenylation gives FAD. 

3.2 

Enzyme Overexpression and Purification 

The current status of the overexpression and purification of the riboflavin bio- 
synthetic enzymes is summarized in the table [15, 17, 18]. 



Gene Sequenced? Overexpressed? Purified? Other source [4] Reference 



bA 


yes 


yes 


yes 




19 


bD 


yes 


yes 


yes 




20 


bB 


yes 


yes 


yes 




21 


bE 


yes 


yes 


yes 




22, 23 


bC 


yes 


yes 


yes 


yes 


24 


bF 


yes 


no 


no 


yes 


15. 25 
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Fig. 6. The flavin adenine dinucleotide biosynthetic pathway 
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Fig. 7. Mechanistic proposal for GTP cyclohydrolase II 



3.3.2 

Pyrimidine Reductase 

A mechanistic proposal for this reaction is outlined in Fig. 8. The pro-S proton 
at C-1 of 38 is derived from nicotinamide. This excludes an alternate mechanism 
involving an Amadori rearrangement [26]. 



3.3.3 

3,4-Dihydroxy-2-butanone-4-phosphate Synthase 

A postulated mechanism for the conversion of 47 to 40, which is consistent with 
extensive labeling studies carried out on the enzyme isolated from Candida 
guilliermondi, is outlined in Fig. 9 [27, 28, 29, 30]. 



3.3.4 

6,7-Dimethyl-8-ribityl Lumazine Synthase 

Mechanistic [31] and structural [32] studies have been carried out on the enzyme 
isolated from B. subtilis, which exists as a large complex. The jS-subunits 
catalyze the lumazine synthase reaction and are arranged as an icosahedral shell 






Fig. 8. Mechanistic proposal for pyrimidine reductase 
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51 52 S 51 40 



Fig. 9. Mechanistic proposal for 3,4-dihydroxy-2-butanone-4-phosphate synthase 



surrounding a core consisting of the three a-subunits. The a-subunits catalyze 
the synthesis of riboflavin from lumazine. 

A mechanistic proposal for lumazine synthase, consistent with the crystal 
structure and with the observation that the methyl group of 40 ends up as the 
C-6 methyl group of lumazine, is outlined in Fig. 10 [31, 33]. Butanedione is not 
an intermediate and none of the proposed intermediates have yet been trapped. 



3.3.5 

Riboflavin Synthase 

Riboflavin synthase catalyzes the disproportionation of two molecules of 
lumazine to give riboflavin and 39 [34]. This complex reaction also occurs 
non-enzymatically [35-37]. The mechanism has not yet been fully established. 
A proposal, consistent with the regiochemistry of the reaction [31, 38-40], with 
the observation of facile H/D exchange at the C-7 methyl group of lumazine [38, 
40-42], with the facile nucleophilic addition to C-7 of lumazine [36,43,44], and 
with NMR studies [45-47], is outlined in Fig. 11 [39-40]. 




Fig. 10 . Mechanistic proposal for 6,7-Dimethyl-8-ribityl lumazine synthase 
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R = Ribityl side chain 

Fig. 11 . Mechanistic proposal for riboflavin synthase 



4 

Folate Biosynthesis 

Folate (tetrahydropteroylpolyglutamate) functions as the Ci-transfer agent in a 
variety of important metabolic processes [48]. 



4.1 

Biosynthetic Pathway 

The folate biosynthetic pathway is shown in Fig. 12 [49]. Loss of C-8 from 
guanosine triphosphate followed by rearrangement gives 66. Dephosphoryla- 
tion, cleavage of the side chain, and pyrophosphorylation gives 70. Displacement 
of the pyrophosphate by p-aminobenzoic acid (formed from chorismic acid by 
amination and loss of pyruvate- see Fig. 13) followed by condensation with 
glutamate and reduction gives tetrahydrofolate 73. Polyglutamylation at the 
y-carboxy group gives the biochemically active form of the cofactor. 
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Fig. 12. The folate biosynthetic pathway 
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Fig. 13 . The p-aminobenzoic acid biosynthetic pathway 




4.2 

Enzyme Overexpression and Purification 

The current status of the overexpression and purification of the folate bio- 
synthetic enzymes is summarized in the table. 



Gene 


Sequenced? 


Overexpressed? 


Purified? 


Other source [4] 


Reference 


folE 


yes 


yes 


yes 




50-52 


folX 


no 


no 


partial 


no 


53 


folY 


no 


no 


partial 


no 


53 


folZ 


no 


no 


partial 


yes 


54,55 


folK 


yes 


yes 


yes 




56 


folP 


yes 


yes 


yes 




57 


folC 


yes 


yes 


yes 




58, 59 


folA 


yes 


yes 


yes 




60,61 


pabA 


yes 


yes 


yes 




62. 63 


pabB 


yes 


yes 


yes 




62. 63 


pabC 


yes 


yes 


yes 




64, 65 



4.3 

Mechanistic Highlights 

4.3. 1 GTP Cydohydrolase I 

A mechanistic proposal, supported by the crystal structure of the enzyme with 
bound deoxyguanosine triphosphate, is outlined in Fig. 14 [66, 67]. 



4.3.2 

Dihydroneopterin Aldolase 

A mechanistic proposal is outlined in Fig. 15. No mechanistic studies have been 
carried out on this enzyme. 



4.3.3 

DIhydropteroate Synthase 

This enzyme is the target of the sulfonamide antibiotics [68]. The mechanism 
has not been determined. S^l or 5^2 mechanisms are possible. 
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OPPP 



66 



Fig. 14. Mechanistic proposal for GTP cyclohydrolase I 




Fig. 15. Mechanistic proposal for dihydroneopterin aldolase 



4.3.4 

ADC Synthase 

A mechanistic proposal is outlined in Fig. 16. It is not known whether the car- 
binolamine 82 or free ammonia reacts with chorismate [69], or whether the 
addition of the amine involves the initial addition of an active site nucleophile 
to chorismate [70-74]. 



4.3.5 

ADC Lyase 

A mechanistic proposal for this pyridoxal-dependent enzyme is outlined in 
Fig. 17 [64]. No mechanistic studies have been reported. 
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Fig. 16 . Mechanistic proposal for ADC synthase 




Fig. 17 . Mechanistic proposal for ADC lyase 
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Fig. 18 . The mechanism of dihydrofolate reductase 



4.3.6 

Dihydrofolate Reductase 

The mechanism of this intensively- studied enzyme is shown in Fig. 18 [48, 75, 
76, 77]. 

5 

Molybdopterin Biosynthesis 

Molybdopterin is the metal-binding cofactor in the molybdenum- and tungsten 
dependent oxidoreductases [78,79]. 



5.1 

Biosynthetic Pathway 

The biosynthesis of molybdopterin is outlined in Fig. 19. The initial step in- 
volves rearrangement of guanosine triphosphate to precursor Z. Sulfur transfer 
followed by metallation and guanylation gives the cofactor. 



5.2 

Enzyme Overexpression and Purification 

The current status of the overexpression and purification of the molybdopterin 
biosynthetic enzymes is summarized in the table. 



Gene 


Sequenced? 


Overexpressed? 


Purified? 


Other source? [4] 


Reference 


moaABC 


yes 


no 


no 


no 


80 


moaDE 


yes 


no 


yes 


no 


80,81 


moeB 


yes 


yes 


yes 


no 


80. 82 


mobAB 


yes 


no 


no 


no 


80 


modABCD 


yes 


no 


no 


no 


80 


mogA 


yes 


no 


no 


no 


80. 83 
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Fig. 19 . The molybdopterin biosynthetic pathway 




MobAB 

ModABCD 

MogA 



5.3 

Mechanistic Highlights 



5.3.1 

Precursor Z Synthase 

Mechanistic studies on the formation of the molybdopterin cofactor are still at 
an early stage. Conversion of guanosine, presumably as the triphosphate, to 
precursor Z occurs with retention of C-8 [84]. A possible mechanism for this 
third type of cyclohydrolase, which is consistent with the labeling experiment, is 
outlined in Fig. 20. (The other two types of cyclohydrolase are cyclohydrolase I 
for folate biosynthesis and cyclohydrolase II for riboflavin biosynthesis. In both 
cases, C8 is removed as formate.) 



5.3.2 

Molybdopterin Synthase 

The mechanism for the conversion of precursor Z to molybdopterin is not 
known. Sulfur, from an undetermined source, is transferred from MoaD to pre- 
cursor Z [81, 85]. 

A mechanistic proposal for the sulfur transfer, tentatively based on the 
sequence similarity between MoeB and the ubiquitin activating enzyme is out- 
lined in Fig. 21 [80]. MoeB has high sequence similarity to ThiF [86], suggesting 
that similar sulfur transfer chemistry may occur during the biosynthesis of the 
thiazole moiety of thiamin (see section 6 of this review). 





112 



T.P. Begley et al. 






Fig. 20. Mechanistic proposal for precursor Z synthase 




Fig. 21. Mechanistic proposal for molybdopterin synthase 
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Fig. 21 (continued) 



6 

Thiamin Biosynthesis 

Thiamin is involved in the stabilization of the acyl carbanion intermediate and 
is particularly important in carbohydrate metabolism [87]. 



6.1 

Biosynthetic Pathway 

The thiamin biosynthetic pathway is outlined in Fig. 22 [88]. Overall the path- 
way involves the separate synthesis of the thiazole 111 and the pyrimidine 114 
which are then coupled. l-Deoxy-D-xylulose-5-phosphate (see sections 11 and 
12 of this review) is the precursor to the five carbon unit of the thiazole [89], 
cysteine is the sulfur source [90, 91], and the C2-N3 atoms of the thiazole are 
derived from the a-carbon and the amino group of tyrosine [92-94]. The 
pyrimidine is derived from 5-aminoimidazole riboside (AIRs), an intermediate 
on the purine biosynthetic pathway. This reaction involves a complex rearrange- 
ment in which the C4' carbon of AIRs is inserted into the C4-C5 imidazole 
double bond, converting the imidazole to a pyrimidine, and the C2' carbon of 
AIRs is used to methylate the C2 position of the imidazole [95-98]. 



6.2 

Enzyme Overexpression and Purification 

The current status of the overexpression and purification of the thiamin bio- 
synthetic enzymes is summarized in the table. 



Gene 


Sequenced? 


Overexpressed? 


Purified? Other source [4] 


Reference 


thiC 


yes 


yes 


yes 


86. 99,100 


thiE 


yes 


yes 


yes 


86,101,102 


thiF 


yes 


yes 


yes 


86, 103 


thiGi 


yes 


yes 


yes 


86, 103 


thiG2 


yes 


yes 


yes 


86. 103 
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6.3 

Mechanistic Highlights 
6.3.1 

Thiazole Formation 

Cell free biosynthesis of the thiazole has not yet been established. A mechanistic 
proposal based on the sequence similarity between ThiF, MoeB, and the 
ubiquitin activating enzyme and on the occurrence of a putative zinc ion- 
binding motif in ThiF [80] is outlined in Fig. 23 [114]. 




COOH 

124 126 127 



Fig. 23. Mechanistic proposal for the thiazole formation 
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Fig. 24. Mechanistic proposal for the pyrimidine formation 



6.3.2 

Pyrimidine Formation 

Cell free biosynthesis of the pyrimidine has not yet been established. A mecha- 
nistic proposal, which is consistent with the labeling studies, is outlined in Fig. 24. 

7 

Biotin Biosynthesis 

Biotin is the cofactor involved in the catalysis of bicarbonate-dependent car- 
boxylation reactions. 
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7.1 

Biosynthetic Pathway 

The biotin biosynthetic pathway is outlined in Fig. 25 [115]. Pimeloyl-CoA, 
synthesized by a variation of the fatty acid biosynthesis pathway, is condensed 
with alanine to give 142. 

A transamination reaction, using S-adenosylmethionine as the amine donor, 
gives 143. Urea formation, followed by sulfur insertion into the C-IH and the 
C-4H bonds gives biotin. 



7.2 

Enzyme Overexpression and Purification 

The current status of the overexpression and purification of the biotin bio- 
synthetic enzymes is summarized in the table [115, 116]. 



Gene 


Sequenced? 


Overexpressed? 


Purified? 


Other source [4] 


Reference 


bioC 


yes 


no 


no 


no 


117 


bioH 


yes 


no 


no 


no 


117 


bioF 


yes 


yes 


yes 


yes 


118, 119 


bioA 


yes 


no 


yes 




120 


bioD 


yes 


yes 


yes 




121 


bioB 


yes 


yes 


yes 




122 
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Fig. 25. The biotin biosynthetic pathway 
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7.3 

Mechanistic Highlights 



7.3.1 

Pimeloyl CoA Synthase 

Labeling studies suggest that pimeloyl CoA is synthesized by a modified fatty 
acid pathway (Fig. 26) [117, 123, 124]. Cell free enzymatic activity has not 
been reported. Pimeloyl CoA is also synthesized from pimelic acid in Bacillus 
sphaericus [125]. 



7.3.2 

KAPA Synthase 

The mechanistic proposal for this pyridoxal phosphate-dependent enzyme is 
outlined in Fig. 27 and is supported by isotope exchange experiments [118]. 



7.3.3 

DAPA Synthase 

This PLP-dependent enzyme is unique in that it requires SAM rather than gluta- 
mic acid as the amino group donor. 

Although mechanistic studies have not been carried out, the reaction is likely 
to follow the standard transamination mechanism (Fig. 28) [126, 127]. 
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Fig. 26. The pimeloyl-CoA biosynthetic pathway 
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Fig. 27. Mechanistic proposal for KAPA synthase 



7.3.4 

Dethiobiotin Synthetase 

The mechanistic proposal for this reaction is outlined in Fig. 29 and is sup- 
ported by the demonstration that 162 is an intermediate [128] and by the crystal 
structure [129, 130]. 
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Fig. 28. Mechanistic proposal for DAPA synthase 



7.3.5 

Biotin Synthase 

This very interesting reaction involves the formal insertion of a sulfur atom into 
two unactivated CH bonds. The purified protein contains a [2Fe-2S] cluster 
[131]. A defined system, consisting of biotin synthase, flavodoxin, flavodoxin 
reductase, fructose 1,6-bisphosphate, cysteine, DTT, NADPH, ferrous ion, and 
SAM, capable of catalyzing the conversion of dethiobiotin to biotin has been 
characterized [132-134]. This system is still incomplete and gives a maximum of 
two moles of biotin per mole of biotin synthase. 
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Fig. 29. Mechanistic proposal for dethiobiotin synthetase 



The sulfur insertion occurs with scrambling of stereochemistry at C-1 and 
with retention at C-4 [135, 136]. The protons at C-2 and at C-3 are not lost during 
the reaction. 1-Mercaptodethiobiotin maybe an intermediate [137, 138]. In the 
reconstituted system, sulfur from cysteine or SAM was not incorporated into 
biotin [132] suggesting that the iron sulfur cluster is the sulfur source. 

A mechanistic hypothesis is outlined in Fig. 30. Hydrogen atom abstraction 
from C-1 by an adenosyl radical (or by a protein radical formed from the adeno- 



Fe^s^Fe, 

164 



HjC 



HN 

h4 — 

)-R 



Ad 



144 



NH 



O 

A 

HN I 

"ft" 

170 



Ad- 



165 




COOH 

167 



S— CH 3 



o 

A 

HN NH 
H») — ^H 
Hjq )-R 

Ad> H 

168 

O 

A 

HN • 



NH 

"tt" 



'S' 

2 



CH, 

Ad 



. 1I,S.IIC 
Fe.^,Fo.^ 

166 



CH2 

Ad 



HN 



O 

A 



NH 

Hi4 — ^H 



Y 

169 



CH2 

Ad 



X-H 



COO- 



Fig. 30. Mechanistic proposal for biotin synthase 
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syl radical) gives 168. Radical substitution on the undefined sulfur source (X-S-Y) 
would give 169. A second radical substitution reaction would complete the 
formation of biotin. The requirement for SAM in the reconstituted biotin 
synthase reaction mixture suggests the intermediacy of the adenosyl radical 
[132, 139]. Reductive cleavage of sulfonium salts [140-142] and radical sub- 
stitutions at sulfur are precedented [143]. 

8 

Lipoic Acid Biosynthesis 

Lipoic acid is a cofactor involved in acyl group transfer and 2-electron redox 
reactions. 



8.1 

Biosynthetic Pathway 

The biosynthesis of lipoic acid from octanoic acid is outlined in Fig. 31 [115] 
and involves the insertion of sulfur into two unactivated CH bonds [144]. 

8.2 

Enzyme Overexpression and Purification 

The current status of the overexpression and purification of lipoate synthase is 
summarized in the table. 



Gene 


Sequenced? 


Overexpressed? 


Purified? Other source [4] 


Reference 


lipA 


yes 


yes 


partial no 


145, 146 



8.3 

Mechanistic Highlights 



8.3.1 

Lipoate Synthase 

This reaction is analogous to the sulfur insertion reaction involved in biotin bio- 
synthesis, and LipA and BioB show significant sequence similarity [145, 146]. 
8-Mercapto-octanoic acid and 6-mercapto-octanoic acid are intermediates 
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Fig. 31. The lipoic acid biosynthetic pathway 
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[147, 148]. The sulfur insertion into the C-6H bond occurs with inversion of 
stereochemistry [149]. Cysteine is the sulfur source [150]. Lipoic acid synthesis 
in a defined cell free system has not yet been accomplished. 

9 

Pantothenic Acid and Coenzyme A Biosynthesis 

Coenzyme A and pantothenic acid function as acyl group carriers and play a key 
role in biochemical Claisen condensation reactions. 



9.1 

Biosynthetic Pathway 

The biosynthesis of panthothenate and coenzyme A is outlined in Fig. 32 [151]. 
Hydroxymethylation of 172 followed by reduction and condensation with 
jS-alanine (formed by the decarboxylation of aspartate) gives pantothenic acid. 
Phosphorylation of panthothenate, followed by condensation with cysteine and 
decarboxylation gives 179. Adenylation, followed by a final phosphorylation at 
the ribose 3'-hydroxyl, completes the biosynthesis. 



9.2 

Enzyme Overexpression and Purification 

The current status of the overexpression and purification of the panthothenate 
and coenzyme A biosynthetic enzymes is summarized in the table. 



Gene 


Sequenced? 


Overexpressed? 


Purified? 


Other source [4] 


Reference 


panB 


yes 


yes 


yes 




152 


panE 


no 


no 


partial 


yes 


153,154 


panD 


no 


no 


yes 




155 


panC 


no 


no 


yes 




156 


coaA 


yes 


yes 


yes 




157 


panW 


no 


no 


no 


yes 


158 


panX 


no 


no 


yes 




159 


panY 


no 


no 


no 


yes 


160,161 


panZ 


no 


no 


no 


yes 


161 



9.3 

Mechanistic Highlights 



9.3.1 

Ketopantoate Hydroxymethyltransferase 

The purified enzyme requires a methylene tetrahydrofolate cofactor [162]. The 
hydroxymethylation reaction proceeds with inversion [163] (Fig. 33) and pro- 
bably occurs by a mechanism analogous to serine hydroxymethyl transferase [48] . 
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Fig. 32. The pantothenic acid and coenzyme A biosynthetic pathway 



9.3.2 

Aspartate Decarboxylase 

This enzyme contains a pyruvamide cofactor. No mechanistic studies have been 
reported. A proposal, analogous to the mechanism of histidine decarboxylase, is 
outlined in Fig. 34 [164]. 
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Fig. 33. Mechanistic proposal for ketopantoate hydroxymethyltransferase 
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Fig. 34. Mechanistic proposal for aspartate decarboxylase 



9.3.3 

4-Phosphopantothenoylcysteine Decarboxylase 

This interesting enzyme contains a pyruvoyl cofactor [159, 164, 165]. The 
mechanism is unknown. Decarboxylation proceeds with retention of stereo- 
chemistry [166]. Two proposals are outlined in Fig. 35. 
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Fig. 35. Mechanistic proposal for 4-phosphopantothenoylcysteine decarboxylase 



10 

S-Adenosylmethionine Biosynthesis 

S-adenosylmethionine (SAM) is the cofactor involved in most biochemical 
methylation reactions. 

10.1 

Biosynthetic Pathway 

The biosynthesis of SAM involves the alkylation of methionine by ATP and is 
outlined in Fig. 36. 
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SAM Synthetase 
(MetK) 

^ H nH, 
^S-"'Os 

COOH 

167 




COOH 



HO OH 



Fig. 36. The S-adenosylmethionine biosynthetic pathway 



10.2 

Enzyme Overexpression and Purfication 

The current status of the overexpression and purification of SAM synthetase is 
summarized in the table. 



Gene 


Sequenced? 


Overexpressed? 


Purified? Other source [4] 


Reference 


metK 


yes 


yes 


yes 


167 



10.3 

Mechanistic Highlights 



10.3.1 

SAM Synthetase 

Stereochemical studies [168] andisotope effects [169] are consistent with an Sn 2 
mechanism. The crystal structure has been determined [170]. 

11 

Pyridoxal Phosphate Biosynthesis 

Pyridoxal phosphate is the cofactor involved in the stabilization of carbanions at 
the a and (i positions of amino acids [126]. 

11.1 

Biosynthetic Pathway 

The pyridoxal biosynthetic pathway is outlined in Fig. 37. Oxidation of 196 
followed by transamination gives 4-hydroxy- L-threonine-4-phosphate 199 [171]. 
Condensation with l-deoxy-o-xylulose (see also ubiquinone and thiamin 
sections) and a final oxidation gives the cofactor [172- 175]. 

11.2 

Enzyme Overexpression and Purification 

The current status of the overexpression and purification of the pyridoxal 
phosphate biosynthetic enzymes is summarized in the table [176]. 








128 



T.P. Begley et al. 



Gene 


Sequenced? 


Overexpressed? 


Purified? 


Other source [4] 


Reference 


epd 


yes 


yes 


yes 




177 


pdxB 


yes 


yes 


no 


no 


178 


serC 


yes 


yes 


no 


no 


178 


pdxAJ 


yes 


no 


no 


no 


176,179 


pdxH 


yes 


yes 


yes 




180 


pdxK 


yes 


no 


yes 




181 



HiV^OP 



D-Erythrose-4-Phosphate 

Dehydrogenase 



^CHO 

D-Erythrose-4-Phosphate 

196 



(Epd) 



H^^OP 

0*^CCK)H 

D-2-Oxoerythronic 

Acid-4-Phosphate 

198 




3-Phosphoserine 
Transaminase . 



^ OP 

j^COOH 

4-Phosphoerythronate 

197 



hA-^op 



4-Phosphoerythronate 

Dehydrogenase 

(PdxB) 




(SerC) 



Vrir 



■'CCXIH 
4-Hydroxy-L-Threonine- 
4-Phosphate 

199 



Pyridoxol Phosphate 
Synthase (PdxAJ) 



Pyridoxol-5'-phosphate Oxidase 
(PdxH) 




Pyridoxol-5'-Phosphate 

201 



Pyridoxol kinase (PdxK) 



Pyridoxol 

Fig. 37. The pyridoxal phosphate biosynthetic pathway 



Pyridoxal-5'-Phosphate 

3 

Pyridoxal kinase (PdxK) 

Pyridoxal 



11.3 

Mechanistic Highlights 



11.3.1 

Pyridoxol phosphate synthase 

The pyridoxol phosphate synthase activity has not yet been reconstituted in a 
cell free system. A mechanistic proposal is outlined in Fig. 38 [176]. 
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Fig. 38. Mechanistic proposal for pyridoxal phosphate synthase 



12 

Ubiquinone Biosynthesis 

Ubiquinone mediates electron transfer between dehydrogenases and the cyto- 
chromes. 

12.1 

Biosynthetic Pathway 

The biosynthesis of ubiquinone is outlined in Fig. 39 [182]. Elimination of 
pyruvate from chorismate followed by prenylation and decarboxylation gives 
2-octaprenyl phenol. A sequence of three hydroxylation/methylation reactions 
completes the biosynthesis. 

The isopentenyl pyrophosphate and the dimethylallyl pyrophosphate 
precursors to the octaprenyl moiety are derived from 1-deoxy-o-xylulose- 
5-phosphate rather than from mevalonic acid (see also thiamin and pyridoxal 
sections) [183, 184]. 

12.2 

Enzyme Overexpression and Purification 

The current status of the overexpression and purification of the ubiquinone bio- 
synthetic enzymes is summarized in the table. 
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Gene 


Sequenced? 


Overexpressed? 


Purified? 


Other source [4] 


Reference 


ubij 


no 


no 


no 


no 


183, 185 


ubiK 


no 


no 


no 


no 


183, 185 


ubiC 


yes 


yes 


yes 




186-188 


ubiA 


yes 


yes 


no 


no 


189-192 


ubiD 


yes 


no 


no 


no 


193 


ubiX 


yes 


no 


no 


no 


182 


ubiB 


yes 


no 


no 


no 


194 


ubiG 


yes 


no 


partial 


no 


194-196 


ubiH 


no 


no 


no 


no 


194, 182 


ubiE 


yes 


no 


no 


no 


182, 194, 197 


ubiF 


no 


no 


no 


no 


194, 182 
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12.3 

Mechanistic Highlights 



12.3.1 

4-Hydroxybenzoate Octaprenyl Transferase 

The mechanism for this reaction has not been studied. A simple electro- 
philic substitution as outlined in Fig. 40 is likely. The reaction may involve an 
Sn 2 attack by the phenol on the pyrophosphate or proceed via a carboca- 
tion. 



12.3.2 

OHB Decarboxylase 

The mechanism of this reaction has not been studied. A proposal is outlined in 
Fig. 41. 



12.3.3 

2-Octaprenylphenol Monooxygenase 

There are three hydroxylation reactions on the ubiquinone biosynthetic 
pathway. All three oxygens are derived from molecular oxygen [198] and heme 
is not required [199]. This suggests that these hydroxylations proceed by a 
mechanism analogous to that of the flavin dependent tyrosine hydroxylase 
(Fig. 42) [14], 




211 210 219 212 

R=Heptaprenyl Side Chain 

Fig. 40. Mechanistic proposal for 4-hydroxybenzoate octaprenyl transferase 




R=Heptaprenyl Side Chain 



Fig. 41. Mechanistic proposal for OHB decarboxylase 
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H H 




PO OH 

Fig. 42. Mechanistic proposal for 2-octaprenylphenol monooxygenase 




228 229 230 

Fig. 43. Mechanistic proposal for the formation of isopentenyl pyrophosphate 





Cofactor Biosynthesis: A Mechanistic Perspective 



133 





NADPH 



OPP 






OPP 



R= 



235 



208 



Fig. 43 (continued) 





12.3.4 

Formation of Isopentenyl Pyrophosphate 

A proposal for the formation of l-deoxy-D-xylulose-5-phosphate and its con- 
version to isopentenyl pyrophosphate is outlined in Fig. 43 [183, 185]. None of 
the enzymes involved have yet been cloned or characterized. 

13 

Menaquinone Biosynthesis 

Menaquinone (vitamin K 2 ) mediates electron transfer between dehydrogen- 
ases and the cytochromes and is the major electron carrier cofactor during 
anaerobic growth of E. coli. Vitamin K is also an essential cofactor in the 
post-translational carboxylation of glutamic acid residues in several protein 
systems [200]. 

13.1 

Biosynthetic Pathway 

The biosynthesis of menaquinone is outlined in Fig. 44. Isomerization of 
chorismate to isochorismate followed by condensation with a-ketogluta- 
rate and aromatization gives o-succinylbenzoic acid. Conversion of 238 
to the Co A thio ester, followed by cyclization, prenylation and methyla- 
tion completes the biosynthesis. The biosynthesis of the prenyl side chain 
follows the alternative terpene biosynthetic pathway described for ubiqui- 
none. 

13.2 

Enzyme Overexpression and Purification 

The current status of the overexpression and purification of the menaquinone 
biosynthetic enzymes is summarized in the table [182]. 
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Gene 


Sequenced? 


Overexpressed? 


Purified? 


Other source [4] 


Reference 


^9 


yes 


no 


no 


no 


201,202 




yes 


yes 


partial 


no 


203-206 




yes 


no 


partial 


no 


205-207 


menE 


yes 


yes 


yes 




208-212 


menB 


yes 


no 


no 


no 


212,213 


menA 


yes 


no 


no 


no 


214,215 


menG 


yes 


no 


no 


no 


197,212,216 



-OOCv^ 



Q COO ■ Isochorismate Synthase 

(MenF) 






HO’ 
Chorismate 

74 









COO- 



Isochorismate 

236 



SHCHC Synthase 
(MenD) 

a-Ketoglutarate 




O 

2-Succinyl-6-Hydroxy-2,4- 
Cyclohexadiene- 1 -Carboxylate 
(SHCHC) 

237 




OSB-CoA Synthetase 
(MenE) 

CoA-SH ” 
ATP 



o-Sucdnylbenzoic Acid 
(OSB) 

238 



--^ss^COO- 



DHNA Synthase 
(MenB) 



COSCoA 




coo- 



DHNA Prenyltransferase 
(MenA) 



o-Succinylbenzoyl-CoA 

(OSB-CoA) 

239 



l,4-Dihydroxy-2-Naphthoic Acid 
(DHNA) 

240 





Fig. 44. The menaquinone biosynthetic pathway 
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13.3 

Mechanistic Highlights 

13.3.1 

Isochorismate Synthase 

The mechanism for this enzyme has not yet been determined. Three mecha- 
nisms have been proposed and are outlined in Fig. 45 [70]. 

13.3.2 

SHCHC Synthase 

The mechanism of this reaction has not been studied. A proposal is outlined in 
Fig. 46 [204]. 



13.3.3 

DHNA Synthase 

The stereochemistry of this reaction has been studied. The pro-R proton is initi- 
ally removed from C-2 and the pro-S proton is removed from C-3 [217]. A 
mechanistic proposal is outlined in Fig. 47. 



13.3.4 

DHNA Prenyl Transferase 

The mechanism for this reaction has not been studied. A simple electrophilic 
substitution as outlined in Fig. 48 is likely. The reaction may involve an 5^2 
attack by the phenol on the pyrophosphate or proceed via a carbocation. 




74 236 




Fig. 45. Mechanistic proposals for isochorismate synthase 



243 
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Fig. 46. Mechanistic proposal for SHCHC synthase 




Fig. 47. Mechanistic proposal for DHNA synthase 




CH 10 



Fig. 48. Mechanistic proposal for DHNA prenyl transferase 
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14 

Summary and Conclusions 

After the cofactor biosynthetic pathways had been determined, interest in co- 
factor biosynthesis declined because the enzymes involved could not be isolated 
in quantities suitable for mechanistic studies. This is clearly no longer the case. 
Most of the genes involved in the biosynthesis of the E. coli cofactors have been 
cloned and sequenced but the overexpression and the chemistry lags far behind 
the molecular biology. There are now many interesting tractable problems in this 
area awaiting solution by chemists interested in solving mechanistic puzzles. 

15 
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Vitamin B12 (cyanocobalamin) is the normal isolated form of coenzyme B12 (adenosylcobal- 
amin), a structure of marvellous architecture and amazing biological activity. It belongs to 
the family of tetrapyrroles which includes inter alia the haems and the chlorophylls. This 
review begins with a brief overview of the biosynthesis of tetrapyrroles in general but then 
concentrates on recent research on Bj2 biosynthesis. The first main section reviews the bio- 
synthesis of uro’gen III, the last common precursor of all natural tetrapyrroles, concentrating 
particularly on the three enzymes, porphobilinogen synthase, hydroxymethylbilane synthase 
and uro’gen III synthase. Crystal structures are available for the second of these enzymes and 
a new proposal is presented for its detailed mode of action. The second main section reviews 
the recent discovery of the complete biosynthetic pathway from uro’gen III to hydrogen- 
obyrinic acid in Pseudomonas denitrificans, which has revealed some beautiful and totally 
unexpected chemistry. A short section then describes the many similarities and some 
differences in the chemistry used by the micro-aerophilic organism Propionibacterium 
shermanii for the synthesis of coenzyme Bj2 compared with that seen in the aerobic Ps. 
denitrificans. Finally an account is given of the remarkable steps needed to complete the 
synthesis of the coenzyme in both organisms. 

Keywords: Vitamin B12; Tetrapyrrole; Porphyrin; Coenzyme B 12; Corrin. 
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List of Symbols and Abbreviations 

ALA 5-aminolaevulinic acid 

copro’gen coproporphyrinogen 

HMBS hydroxymethylbilane synthase 

Kb kilobase-pairs (of DNA) 

NADH reduced nicotinamide adenine dinucleotide 

NADPH reduced nicotinamide adenine dinucleotide phosphate 

NMR nuclear magnetic resonance 

PBG porphobilinogen 

PLP pyridoxal phosphate 

PMP pyridoxamine phosphate 

proto’gen protoporphyrinogen 

SAM S-adenosyl-L-methionine 

uro’gen uroporphyrinogen 

UROS uroporphyrinogen III synthase 

1 

Overview of Tetrapyrrole Biosynthesis 

The tetrapyrroles are a group of natural products which include the haems (e.g. 
haem b 1), the chlorophylls (e.g. chlorophyll a 2) and the corrinoids (e.g. co- 
enzyme Bi 2 4), see Scheme 1 [1 -6]. In addition to these well-known and wide- 
spread enzymic cofactors, other tetrapyrroles are used in more restricted cases, 
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Scheme 1. Some of the more important tetrapyrroles 



including bilins (e.g. phycocyanin 6 ) used for light-harvesting in algae, siro- 
haem 7 in sulphite reductase, haem dj 8 in the nitrite reductase/cytochrome 
oxidase of denitrifying bacteria, and coenzyme F 430 9 in methyl coenzyme M 
reductase, the final enzyme in methane production. These compounds are all 
intensely coloured and every living organism has an absolute requirement for 
one or more of them. For this reason they have been christened the “pigments 
of life”. 
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All the natural tetrapyrroles have their four pyrrolic rings, at various oxida- 
tion levels, separated by single carbon atom bridges, except in the corrinoids 
(e.g. 3) where two of the rings are directly linked. The substituents on the 
pyrrolic rings divide into two classes: in 4, 7 and 9 we find, for the most part, 
acetate and propionate side-chains with a variable number of additional methyl 
groups, whereas in 1,2 and 6 most of the acetate and propionate side-chains have 
been decarboxylated to give methyl and ethyl (or vinyl) groups. A common 
feature among all the compounds, however, is that the substituents occur in the 
same order on three of the pyrrolic rings but are reversed on the fourth one, 
which is always drawn at the lower left-hand side and is called ring D in all but 
the bilins. 

The common features shared by all the tetrapyrroles are a direct consequen- 
ce of the fact that they are all derived from a single common tetrapyrrolic 
macrocycle, uroporphyrinogen III 10, abbreviated to uro’gen III, Scheme 2. It is 
at the stage of uro’gen III that there is a major branching of the biosynthetic 
pathways. One branch, found in higher organisms, starts with decarboxylation 
of uro’gen III and then involves two oxidative steps to give protoporphyrin IX 
11. Here a further branching of the pathway occurs, leading eventually to the 
haems, chlorophylls and bilins. This branch of tetrapyrrole biosynthesis is 
outlined in Scheme 2 but will not be covered further here. The interested reader 
is referred to a number of reviews which cover this biosynthetic pathway in 
some in depth [1-4]. The pathway that we will be following begins with two 
methylations to give precorrin -2 12 and leads eventually to corrinoids, with off- 
shoots leading to sirohaem [ 6 - 8 ], haem dj [9, 10] and coenzyme F 430 [11] Scheme 3. 
This pathway is found only in bacteria, especially anaerobic bacteria, and is 
believed to be the more primitive one from an evolutionary point of view. 




Protoporphyrin IX 11 



Scheme 2. The start of the pathway from uro’gen III to haems, bilins and chlorophylls 
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Precorrin-2 12 

Scheme 3. The start of the B 12 pathway from uro’gen III 



Vitamin B 12 (cyanocobalamin) 3 is, in fact, not a natural product as the 
cyanide ligand to the cobalt ion is added during the isolation procedure. 
Coenzyme Bj 2 (adenosylcobalamin) 4 and methylcobalamin 5 are the true final 
products of the biosynthetic pathway. Co enzyme B 12 is the cofactor for a number 
of enzymic rearrangement reactions, such as that catalysed by methylmalonyl 
CoA mutase, and methylcobalamin is the cofactor for certain methyl transfer 
reactions, including the synthesis of methionine. A number of anaerobic bac- 
teria produce related corrinoids in which the dimethylhenzimidazole moiety of 
the cobalamins (3-5) is replaced by other groups which may or may not act as 
ligands to the cobalt ion, such as adenine or p-cresol [12]. 

2 

Biosynthesis of Uroporphyrinogen III 



2.1 

Introduction 

Early experiments on tetrapyrrole biosynthesis in the 1940s and 1950s had 
established that the pathway shown in Scheme 4 is followed in animals [ 1 - 4] . 
Thus glycine 13 and succinyl CoA 14 are condensed to give 5-aminolaevulinic 
acid 15 (ALA). Two molecules of ALA are combined to give porphobilinogen 16 
(PBG) and then four molecules of PBG are combined to form uro’gen III 10. Sub- 
sequent research has demonstrated that ALA, PBG and uro’gen III are inter- 
mediates of tetrapyrrole biosynthesis in all organisms but it has transpired that 
plants, algae and most bacteria use an alternative route for the production of 
ALA starting from glutamate. 

Once the intermediates had been identified, research concentrated mostly on 
the individual enzymes of the pathway. The current state of knowledge on each 
of the enzymes is reviewed in the following sections. 



2.2 

5-Aminolaevulinic Acid Synthesis 

In animals, yeasts and purple photo synthetic bacteria ALA 15 is made from 
glycine 13 and succinyl CoA 14 by the action of a single enzyme, ALA synthase. 
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HO 5 C 



HOgC. 



HO2C 



H,N 



S 





CO2H 



COSCoA 

Glycine 13 Succinyl CoA 14 ' ALA IS ' PBG1B 

Scheme 4. Outline of the biosynthetic pathway to uro’gen III 



uro'gen III 10 



This enzyme has been isolated from many organisms and in all cases studied the 
enzyme is dependent on pyridoxal phosphate (PLP). In isotopic labelling ex- 
periments it was found that all four carbon atoms of succinyl CoA 14 are in- 
corporated into ALA along with C-2 and the nitrogen atom of glycine. Further- 
more only one of the two hydrogen atoms at C-2 of glycine is incorporated, Hj^ 
being lost to the medium while Hj is incorporated into the pro-S position at 
C-5 of ALA [1,2]. From this information, the mechanism shown in Scheme 5 can 
be proposed. In order to end up with the observed stereochemistry, either the 
initial acylation of the glycine occurs with retention of configuration and the 
subsequent decarboxylation occurs with inversion or vice versa. 

From 1974 onwards it became increasingly apparent that in plants a different 
route to ALA must be involved [1, 2]. It was shown that incorporation of radio- 
activity from glycine and succinate into chlorophyll a 2 was poor compared with 
the incorporation of five-carbon compounds such as glutamate 17. labelling 
of glutamate showed that all five carbon atoms are incorporated and this route 
to ALA from glutamate is generally known as the Cj route as opposed to the 
Shemin route from glycine and succinyl CoA. Subsequently this C 5 route has 
been shown also to operate in algae and most bacteria, including all the major 
producers of coenzyme Bj 2 and related corrinoids. 

Glycino 13 

O 

CoA^-v^COjH 




o 



PLP 






Scheme S. Probable mechanism for ALA synthase 
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The individual steps involved in the conversion of glutamate into ALA are 
shown in Scheme 6 [1, 2, 13]. First the glutamate is converted by an ATP-depen- 
dent ligase into a *RNA ester 18, which appears to be the same as the gluta- 
myl-*RNA used for protein synthesis in the plant chloroplast. This gluta- 
myl-*RNA is then reduced by an NADPH-dependent reductase to glutamate 
1 -semialdehyde 19. As expected of an a-amino aldehyde, 19 is not particularly 
stable under neutral or basic conditions but can be isolated under acidic con- 
ditions, under which it cyclises to the corresponding lactol [2]. 

The final step is a double transamination reaction which isomerises gluta- 
mate 1- semialdehyde 19 to ALA. Logically, this could occur via either PLP- 
mediated loss of the amino group to give dione 20 followed by regaining of the 
same amino group at C-5 to give ALA or, alternatively, pyridoxamine phosphate 
(PMP) -mediated gain of an amino group could give diamine 21 and then loss of 
the amino group at C-4 would give ALA (Scheme 7). UV spectroscopy indicates 




Glutamate 17 Glutamyl-'RNA 18 . GMamate ALA 15 

1-semialdehyde 19 

Scheme 6. ALA biosynthesis in plants, algae and most bacteria 




22 23 24 



Scheme?. Possible mechanisms for glutamate 1-semialdehyde aminomutase 
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that the enzyme isolated from various sources has predominantly PMP bound 
with a minor amount of PLP [ 1 3 ] . Treatment with the diamine 2 1 converts the PLP 
form into the PMP form and stimulates the enzyme, whereas dione 20 increases 
the concentration of the PLP form. Given these observations, it seems likely that 
the preferred pathway is via diamine 21. This has been confirmed by a double 
labelling experiment in which [l-^^C]glutamate and [^®N] glutamate were mixed 
and converted into ALA by the enzymes from Chlamydomonas reinhardtii. NMR 
spectroscopy of the resulting ALA revealed that the two labels had come together 
in some molecules, indicating intermolecular transfer [14]. The reaction sequen- 
ce proceeding via diamine 21 requires such intermolecular transfer, whereas the 
alternative sequence via the dione 20 would more likely involve intramolecular 
transfer of the amino group. Finally, the apoenzyme can only be reactivated by 
PMP not PLP, again suggesting the pathway via diamine 21 is followed [15]. 

It is conceivable that instead of hydrolysis of the imine intermediate 22 to give 
diamine 21 and PLP followed by formation of the alternative imine 24, an intra- 
molecular transfer of the pyridoxyl moiety from one nitrogen to the other 
occurs via a five-membered ring intermediate 23. This appears not to be the 
case, however, as it would mean that free PLP is never formed and yet the enzy- 
me is known to be inactivated by a number of suicide inhibitors, such as 
gabaculine, which react specifically with PLP [1,2, 13]. 



2.3 

Porphobilinogen Synthase (5-Aminolaevulinic Acid Dehydratase) 

The next stage in the biosynthesis, dimerisation of ALA 15 to give PEG 16, is 
catalysed by PEG synthase, also known as ALA dehydratase. This transforma- 
tion has been most extensively investigated using enzymes from the photo- 
synthetic bacterium Rhodopseudomonas spheroides and from bovine liver [1, 2] 
and, more recently, with overexpressed enzymes from the bacteria Escherichia 
coli [16, 17] and Bacillus subtilis [18, 19]. In all cases, the formation of an imine 
between ALA and a lysine residue on the enzyme is indicated by inactivation 
caused by NaEH 4 . This still leaves an uncertainty about the mechanism, how- 
ever, because there must be binding sites for two molecules of ALA, one of which 
provides the acetate side-chain of PEG (referred to as the A-site) and the other 
of which provides the propionate side-chain (the P-site), and it is not known in 
which site imine formation occurs. The originally proposed mechanism in- 
volved an imine in the A-site, with the corresponding enamine being the nucleo- 
phile in the key C-C bond-forming step, as shown in Scheme 8a [20]. An alter- 
native mechanism (Scheme 8b) involves imine formation with the lysine residue 
in the P-site and a second imine linkage between the two molecules of ALA [21, 
22]. This was proposed after pulse-labelling experiments, using and ^Re- 
labelled ALA, had shown that the first molecule of ALA to bind ends up as the 
propionate-side of the product, PEG. However, the assumption was made, in pro- 
posing the alternative mechanism, that the first molecule to bind is the one that 
forms the imine. 

Deuterium isotope effects have been measured for the B. subtilis enzyme [19]: 
deuteration of ALA at C-5 had no effect on the rate but deuteration at C-3 
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PBG 16 27 

Scheme 8 a,b. Alternative mechanisms for ALA dehydratase 




reduced the rate by a factor of 3.3 and is 2.3-fold lower. When ALA 

was reisolated after 50% reaction, no loss of deuterium was detected. These 
results indicate that the first deprotonation at C-3 of ALA is a rate-determining 
step and furthermore that the steps leading up to this step must be reversible. 
The rate-determining deprotonation step would be 25 ^ 26 in the first mecha- 
nism (Scheme 8a) or 28 ^ 29 in the second mechanism (Scheme 8b). 

The stereochemistry of the final deprotonation, at the carbon which becomes 
C-2 of PBG, has been determined using [5S-^H]ALA (derived from [2-^H] glycine 
by the ALA synthase reaction, see Sect. 2.2) [1,2]. The tritium label is entirely 
retained in the PBG produced, whereas 50% is lost from ALA which is random- 
ly tritiated at C-5. Thus it is the pro-P hydrogen atom that is lost as illustrated in 
Scheme 8, 27 ^ 16. 

PBG synthase is a metal-requiring enzyme but the metals required vary from 
one source to another. The bovine enzyme requires zinc, the E. coli enzyme 
binds both zinc and magnesium [16, 17], whereas plant enzymes only require 
magnesium. In the bovine enzyme two zinc atoms bind per subunit. An EXAFS 
experiment has shown that the first zinc ion, required for activity, binds in a site 
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where it is five-coordinate, with two or three histidine nitrogen atoms, a sulphur 
from cysteine and one or two oxygen atom as ligands. The second ion binds in a 
site where it is coordinated by four cysteine residues [23]. It is not clear, however, 
whether the metal ions are directly involved in the catalytic mechanism; it is 
possible that they only serve to maintain the active conformation of the enzyme. 

Chlorolaevulinic acid 28 is a potent competitive inhibitor of bovine PEG syn- 
thase, presumably due to binding in the active site in place of ALA, and it also 
inactivates the enzyme by alkylation of a specific cysteine residue [24] 
(Scheme 9). The concentration required for the inactivation is much greater 
than that required for competitive inhibition, however, which suggests that the 
processes occur at different sites on the enzyme. Electrospray mass spectro- 
metry has shown that 28 can alkylate at multiple sites on the B. subtilis enzyme 
without causing more than about 50% loss of activity [18]. It is likely that there 
is no cysteine residue in the active site of this latter enzyme. 

The availability of mass spectrometric techniques (such as electrospray) for 
observing molecular ions from intact enzymes is an important advance in the 
study of the covalent chemistry of proteins. Although the use of radioactively 
labelled molecules has previously allowed the average level of protein modifi- 
cation to be deduced, mass spectrometry is much quicker, more convenient and 
safer; it also shows the distribution between the various multiply modified 
species and, most importantly, gives the molecular weight of the modified 
species, allowing deductions to be drawn as to the nature of the reaction occur- 
ring. 

An inhibitor which certainly does seem to act at the active site of PEG 
synthase is the 3-thia analogue 29 of ALA. Whereas attack by the amino group 
of the active site lysine residue on the keto group of ALA leads to an imine, attack 
on the equivalent thioester carbonyl group of 29 leads to irreversible acylation 
of the amine, as shown in Scheme 9 [18, 25]. NMR was used to demonstrate 
this formation of an amide linkage and mass spectrometry was again used to 
demonstrate that a single acylation is sufficient to inactivate the enzyme. Im- 
portantly, the rate of inactivation was proportional to the square of the concen- 
tration of thioester 29, indicating that two molecules bind before the attack of 
the amino group occurs, because of the close similarity of the inactivator and 
the substrate, it is probable that two molecules of the substrate also bind before 
imine formation occurs. 

It will be obvious from the above that there is still considerable uncertainty 
about the mechanism of PEG synthase. However there have been reports of 
crystallisation of the bovine [26] and yeast enzymes [27], and preliminary X-ray 
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Scheme 9. Inactivation of ALA dehydratase 
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diffraction data on the former have been obtained, so it is hoped the picture will 
become much clearer in the near future when the crystal structures have been 
solved. 



2.4 

Hydroxymethylbilane Synthase (Porphobilinogen Deaminase) 

It was recognised in the 1950s that the next two enzymes in the pathway catalyse 
the tetramerisation of PEG, with loss of ammonia, to give uro’gen III 10. Heat 
treatment inactivated one of the two components resulting in the formation of 
the unrearranged uro’gen 131 instead of uro’gen III, in which ring D is inverted 
(Scheme 10). The first enzyme became known as PEG deaminase and the second 
one was called cosynthetase because it was thought that it somehow interacts 
with PEG deaminase to modify the product that it forms. It was not until 1978, 
that it was discovered that in fact PEG deaminase produces hydroxymethyl- 
bilane 30, which is the true substrate for cosynthetase. In the absence of cosyn- 
thetase this intermediate undergoes a rapid non-enzymic transformation into 
uro’gen I, thus explaining the previous observations [1, 2, 28]. Following this 
discovery the names hydroxymethylbilane synthase (HMES) and uro’gen III 
synthase (UROS) were officially adopted for the two enzymes [29]. 

It was known from several different experiments that the successive PEG 
molecules become covalently linked to HMES during the course of the reaction. 
However, attempts to identify the enzymic group to which the first PEG mole- 
cule became attached were all unsuccessful until two important techniques were 
introduced: firstly the gene for HMES in E. coli {hemC) was cloned and over- 
expressed allowing the production of large quantities of the enzyme and 
secondly it was found that the enzyme-PEG complexes could be separated on a 
multimilligram scale by fast protein liquid chromatography (FPLC). These tech- 
niques made a range of experiments possible which could not be contemplated 
before. In the first of these experiments, HMES was incubated with a slight 
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Scheme 10. The relationship between hydroxymethylbilane synthase and uro’gen III synthase 
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excess of [11-^^C]PBG and the resulting mixture of complexes was separated to 
give approximately 10 mg of pure monoPBG complex. The NMR spectrum 
of this complex, at pH 12 to partially denature the protein, showed a strong new 
peak at <5 24.6 ppm [30, 31]. From extensive model studies, this chemical shift 
was known not to correspond to any of the expected types of attachment, i. e. 
pyrrole-GH2-0, pyrrole-CH 2 -N or pyrrole-CH 2 -S [32], but instead it corres- 
ponded exactly to pyrrole-GH 2 -pyrrole. It follows that there must already be an 
a-free pyrrole on the native enzyme to which the first PBG molecule becomes 
attached. 

The presence of an a-free pyrrole attached to the native HMBS was also 
demonstrated, both in Gambridge and Southampton, by treatment with 
Ehrlich’s reagent, acidic p-dimethylaminobenzaldehyde [30, 33]. This initially 
gave the UV/visible absorbance at 564 nm, typical of the Ehrlich product from 
an a-free pyrrole, but the spectrum then changed to one at 495 nm, typical of a 
dipyrromethene, indicating that the cofactor is in fact a dipyrromethane (e. g. 
32), as shown in Scheme 1 1 and tautomerisation of the initial product 33 occurs 
to give 34. 

The discovery of the dipyrromethane cofactor was clearly a great step 
forward in our understanding of the mechanism of HMBS. It did, however, beg 
the question: to what is the cofactor attached? This question was answered by 
NMR spectroscopy on ^^C-labelled cofactor. Two methods were found to label 
the CO factor: treatment of the native enzyme with acid cleaved off the co factor 
and incubation of the resulting apoenzyme with [11-^^C]PBG then regenerated 
the cofactor in labelled form [34]; alternatively, growing the producing organism 
in the presence of labelled ALA also produced the cofactor in labelled 
form[35-37]. These labelling experiments allowed the structure to be unambi- 
guously defined as 32, consisting of two molecules of PBG coupled together and 
attached to the enzyme via a thioether linkage. Scheme 12. Protein degradation 
studies in Cambridge [38] and Southampton [36] and site-directed mutagenesis 
experiments in Texas [39] all identified the site of attachment as cysteine-242 
and the way the cofactor is involved in the stepwise building of hydroxymethyl- 
bilane 30 is shown in Scheme 12. More recently it has been demonstrated that 
regeneration of the cofactor occurs much more efficiently by incubation of the 
apoenzyme with hydroxymethylbilane 30 rather than with PBG [40]. This pre- 
sumably leads directly to the enzyme-PBG 2 complex 36, Scheme 12, which can 
then continue the normal enzymic reaction, adding two more PBG molecules to 
give the hexapyrrolic enzyme-PBG 4 complex 37. Release of another molecule of 
hydroxymethylbilane would then restore the enzyme to its native form 32. 
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Scheme 11. Reaction of the dipyrromethane cofactor with Ehrlich’s reagent 
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Scheme 12. The mechanism of hydroxy methylbilane synthase 



Subsequent to the work just described, all the details were fully confirmed 
and many more were revealed by X-ray crystal structures of HMDS, Fig. 1 
[41-44]. The crystal structures show an enzyme consisting of three domains; 
between domains 1 and 2 is a cavity partly occupied by the dipyrrole cofactor, 
which is attached to cysteine-242 in the third domain. The nearly planar struc- 
ture of the dipyrrole in the initially reported crystal structure [41] made it ap- 
parent, however, that the dipyrromethane had become oxidised during handling 
to a species in which the two pyrroles were part of a single conjugated rr-system 
(probably a mixture of dipyrromethene and dipyrromethenone). A second 
crystal structure was subsequently obtained of the enzyme with the natural 
reduced cofactor [42, 43]. In this latter structure the first ring of the cofactor is 
in a very similar position to that of the oxidised cofactor but the second ring is 
tilted back into the cavity with an angle of 61° between the planes of the two 
rings. Most recently the structure of the selenomethionine-substituted enzyme 
with the active reduced cofactor has been solved totally independently using 
multiwavelength anomalous dispersion (MAD). It is this structure that is shown 
in Fig. I. 

Significant features of the enzyme revealed by these crystal structures are that 
the carboxylate group of aspartate-84 is hydrogen bonded to both pyrrolic N-H 
groups of the cofactor and all but one of the carboxyl groups of the cofactor, in 
both its reduced and oxidised forms, find good ion pairs with the guanidinium 
groups of arginine residues. The one exception is the propionate side-chain of 
ring 2 of the reduced cofactor, which lacks any direct contact with a positively 
charged counter-ion but does form a water-mediated hydrogen bond with a 
more distant arginine residue, Arg- 176 [45]. This may well be very significant in 
the mechanism of the reaction, as will be described below. 

The importance of all the active-site groups mentioned above has been tested 
by site-directed mutagenesis. Before the crystal structure was available, a num- 
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Fig.1. Ribbon representation of hydroxymethylbilane synthase from E. coli [44]. Domains 1,2 
and 3 are in yellow, blue and magenta respectively. The cofactor, in green, is here in its active 
reduced form 



ber of arginine residues had been changed to either histidine [46] or leucine [47] 
residues. Mutation of arginine residues 131 and 132, which bind ring 1 of the 
cofactor, prevents cofactor assembly and so gives inactive enzyme. Mutations of 
arginines 11,149, 155 and 1 76 do not prevent assembly of the cofactor but either 
prevent or markedly inhibit the later steps of covalent attachment of PEG. 
Interestingly, the Argl55Leu mutant appears to be unable to release the tetra- 
pyrrole product and so accumulates the enzyme-PBG 4 complex, which is not 
observed with wild-type E. coli HMBS. Mutation of other arginine residues (e. g. 
Arg-101 and 232) have much less effect on the enzymic activity. 

Other residues that have been mutated are cysteine-242, aspartate-84, lysine-55 
and lysine-59. Changing cysteine-242, the site of attachment of the cofactor, to a 
serine residue removes essentially all of the catalytic activity by preventing co- 
factor assembly. When aspartate-84 was changed to alanine or asparagine, all 
activity was lost, though assembly of the cofactor did still occur [48]. Changing 
aspartate-84 to a glutamate (i. e. addition of just one CH 2 ) causes loss of 99 % of the 
enzymic activity. The crystal structure of this mutant shows that the carboxylate 
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group of the glutamate residue does not hydrogen bond to the pyrrole-NH of the 
second ring of the cofactor, which presumably accounts for its low activity [45]. 
Lysines 55 and 59 had been shown to be in or near the active site by chemical modi- 
fication studies using PLP and NaBH 4 [49]. Replacement of lysine-55 by glutamine 
has almost no effect on the enzymic activity, however, whereas replacement of 
lysine-59 has no effect on k^-at but increases the for PBG approximately 30-fold 
[50], indicating that it plays some part in binding the substrate but not in the cata- 
lytic reaction. In the crystal structure, lysine-55 is in a region of the protein which 
could not be located, presumably because it is mobile or disordered and lysine-59 
is just on the edge of this region and its side-chain was not located. It is possible that 
this region is a loop that closes over the active-site when the substrate is bound. 

It seems from the crystal structures that there are three binding sites for pyr- 
rolic rings. The reduced cofactor occupies two of them (sites Cl and C2) and it 
is likely that the third one, which is occupied by ring 2 of the oxidised cofactor, 
would normally be the substrate binding site (site S). The substrate must bind 
the other way round from the orientation observed for ring 2 of the oxidised 
cofactor, however, because the aminomethyl carbon, C-1 1 of PBG, must become 
bonded to C-9 at the end of the dipyrromethane cofactor. With this restriction 
in mind, we have modelled the binding of the substrate in this binding site using 
molecular mechanics to predict the energies of various different orientations 
[51 ] . The lowest energy orientation, shown in Fig. 2, has a number of interesting 




Fig. 2. Close up of the active site of HMBS with the first substrate molecule (PBG) in its pro- 
posed non-covalently bound orientation 
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features. The pyrrole ring of the substrate is significantly further forward than 
ring 2 of the oxidised cofactor and is also turned sideways so that, instead of 
having a face-to-face interaction with the phenyl ring of phenylalanine-62, it 
now has an edge-to-face interaction, with 2-H of PEG pointing towards the 
centre of the phenyl ring. This edge-to-face interaction probably both favours 
formation of the intermediate azafulvenium ion 38, Scheme 13, and protects it 
from attack by nucleophiles such as water. Despite this change of orientation 
compared to ring 2 of the oxidised cofactor, the side-chains of the bound PEG 
still make good ion pairs with the same arginine residues (Arg-1 1 and Arg-149). 
A most significant feature is that, whereas in solution PEG forms an intramole- 
cular ion pair between its protonated amino group and the carboxylate of the 
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Scheme 13. Proposed details of the mechanism for the first cycle of HMBS 
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neighbouring acetate side-chain [52], in its predicted enzyme-bound orienta- 
tion the acetate carboxylate swings forwards to ion-pair with arginine- 11 and 
the protonated amine swings backwards to ion-pair with the carboxylate of the 
propionate side-chain on ring 2 of the reduced cofactor. This immediately 
explains why this carboxylate lacks a direct ion-pair in the native enzyme and 
why the protein forces this side-chain to adopt a gauche conformation rather 
than the preferred anti conformation (see Fig. 2). The protonated amino group 
is now nearly perpendicular to the plane of the pyrrole ring, in the correct posi- 
tion to allow expulsion of ammonia to form the azafulvenium ion 38, Scheme 13. 
Carbon-carbon bond formation can then take place to give intermediate 39, in 
which ring 2 of the cofactor is a protonated pyrrolenine. Deprotonation at C-9 of 
the cofactor must then take place to restore this ring to being a pyrrole, as in 35. 
However, inspection of the crystal structure reveals no enzymic residue capable 
of effecting this deprotonation. Instead the carboxylate group of the propionate 
side-chain is perfectly situated to remove the proton from C-9 and having lost 
its counterion when ammonia was released, it would be a suitably basic group. 

The steps shown in Scheme 13 complete the first cycle of covalent attachment 
of PEG to the dipyrromethane cofactor, giving the tripyrrole 35. This cycle now 
has to be repeated three more times. Given the excellent orientation of the 
binding and catalytic groups in the three pyrrolic binding sites identified above 
and the lack of any further such groups nearby, it seems likely that the whole 
pyrrolic chain moves along so that the second ring of the cofactor occupies site 
Cl, previously occupied by the first ring of the cofactor, and the now covalently 
attached substrate ring occupies site C2, previously occupied by the second ring 
of the cofactor. This would then vacate the substrate binding site ready for the 
next substrate to enter. This movement is probably made favourable by the strain 
generated as the central ring becomes a planar pyrrole in the deprotonation step 
39 ^ 35. It is conceivable that the movement of the growing pyrrolic chain is 
made possible by a conformational change of the protein in which domain 3 
swings away from domains 1 and 2 (see also below). There is evidence that such 
a conformational change does take place because the native enzyme is relatively 
unreactive towards thiol-modifying reagents such as methyl methanethio- 
sulphonate [53] and M-ethylmaleimide [54] but it becomes increasingly more 
susceptible to inactivation as increasing numbers of substrate molecules be- 
come bound. The residue which is modified when this inactivation occurs has 
been identified as cysteine-134, which is located on domain 2 at the interface 
with domain 3 [54]. 

After four PEG molecules have been attached, giving hexapyrrole 37, 
Scheme 12, they must be detached to produce the hydroxymethylbilane 30 and 
regenerate the dipyrrolic cofactor 32. Mechanistically, this is the reverse of the 
very first attachment step and it presumably occurs in the same site. In other 
words the two cofactor rings return to their original binding sites. Cl and C2, 
and ring A occupies the substrate binding site, S. Protonation at C-9 on cofactor 
ring 2 then occurs (equivalent to 35 ^ 39) followed by C-C bond cleavage 
(equivalent to 39 ^ 38) and capture of the resulting azafulvene by water. If we 
assume that this mechanism is the exact reverse of the first attachment step then 
the stereochemistry at the methylene group involved will be the same, i. e. either 
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both occur with retention of configuration (as predicted in Scheme 13) or both 
occur with inversion. Either way, the resulting overall stereochemistry at the 
hydroxymethyl carbon of hydroxymethylbilane 30 would be retention of con- 
figuration. This has been proved experimentally by incubation of [ 1 1 and 
[11S-^H]PBG with HMDS followed by chemical degradation of the hydroxyme- 
thylbilane produced to glycolic acid and enzymic analysis [55] . It is known, from 
enzymic conversion of the hydroxymethylbilane through to protoporphyrin IX 
11, that the reactions forming each of the three interpyrrolic methylene groups 
are also stereospecific [1, 2, 28]. However the stereochemistries of these steps 
have not yet been discovered. 

It would be of great interest to obtain crystal structures of the HMBS-sub- 
strate complexes so that the nature of the conformational change that occurs as 
the polypyrrole chain grows can be observed directly. Unfortunately these com- 
plexes, while stable enough to be separated by chromatography, are not in- 
definitely stable and break down by release of monopyrrole units during the 
crystallisation process. A promising alternative approach, however, is to diffuse 
PBG into the crystals and then obtain crystal structures at various time intervals 
using the Laue method, which involves a very brief irradiation from an intense 
synchrotron X-ray source. This technique is currently being applied to HMBS to 
study the build-up of the enzyme-substrate complexes and the preliminary 
results are extremely encouraging [56]. For example it is clear that if domain 3 
does move, the shift is not sufficient in the early stages to disrupt the crystal 
lattice. Also the indications are that aspartate-84 does not significantly change 
position as the PBG is bound, which fits in with the foregoing mechanistic 
scheme. 

Because it has not yet been possible to crystallise the complexes of HMBS 
with the natural substrate, there has been considerable interest in analogues of 
PBG, which might form more stable complexes. A number of analogues have 
been tested in which PBG has altered side-chains [57]. In quite a few cases the 
substrate analogues can bind covalently to the cofactor but cannot complete the 
reaction. An example of this is 2-bromoPBG 40, Scheme 14, which becomes cova- 
lently attached to the native enzyme, enzyme-PBG and enzyme-PBG 2 complexes 
and blocks any further reaction, resulting in effectively irreversible inhibition 
[39, 58]. The 2-fluoro-ll -hydroxy analogue 41 of PBG behaves similarly [59] - a 
hydroxyl group at C-1 1 is known to be displaced almost as easily as the normal 
amino group, making 42 a good substrate for HMBS [60]. 




41 X = F, Y = OH 

42 X = H, Y = OH 

Scheme 14. Some inhibitors and unnatural substrates of HMBS 
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Of all the analogues of PEG that have been tested, only two, 9-fluoroPBG 43 
and the phosphonate analogue 44, have been shown to be able to complete the 
whole reaction, producing a modified hydroxymethylbilane [61]. The turnover 
rates for these two compounds as sole substrates were approximately 100 and 
500 times slower respectively than the rate for PEG (6 min~^). For the fluoroPEG 
a Km value of 108 pM was measured. When 9-fluoroPEG was tested as an inhi- 
bitor of the normal turnover of PEG, an apparent Kj value of 6 pM was measu- 
red. This is surprisingly low because normally the value for a slow substrate 
is the same as its Kj value as a competitive inhibitor. Closer inspection of the 
kinetic data revealed that the graph of rate vs [PEG] in the presence of the inhi- 
bitor is not the usual hyperbolic curve but is sigmoid at low [PEG]. The reason 
for this and for the low apparent Kj seems to be that after one molecule of flu- 
oroPEG has covalently bound to the enzyme, the binding of the next molecule of 
PEG (or of fluoroPEG) is affected and a higher concentration is required to 
achieve half-maximal rate. 

The fact that the maximum turnover rate for 9-fluoroPEG is so much lower 
than for PEG is also something of a surprise because at first sight it would seem 
that the modification is a minor one and is well removed from the part of the 
molecule where the reaction occurs. However, if the mechanistic hypothesis pre- 
sented in Scheme 13 is correct, then this propionate side-chain will play a vital 
role in the deprotonation step, analogous to 39 ^ 35, in the subsequent round of 
addition of a substrate molecule. The introduction of a fluorine atom a to a car- 
boxyl group lowers its pK^ substantially (e.g. pK^ for CH3CO2H is 4.76 and for 
FCH2CO2H is 2.59) and so the carboxylate anion is a weaker base and the depro- 
tonation step would be slower. The involvement of this propionate side-chain 
both in binding the next substrate molecule and as the base for a key step in the 
mechanism would also explain why modified PEG molecules in which the 
length of this side-chain is altered or the carboxyl group is missing are general- 
ly able to bind covalently to the enzyme but the complexes so formed can only 
proceed with attachment of a further pyrrole very slowly if at all [62, 63]. 



2.5 

Uroporphyrinogen III Synthase (Cosynthetase) 

The earliest knowledge of the nature of the rearrangement catalysed by uro’gen 
III synthase (UROS) came with the incorporation of [2,11-^^C2]PEG 16a 
(derived from [5-^^C] ALA) diluted with unlabelled PEG into uro’gen III 10a and 
thence into protoporphyrin 11a [1, 2, 28]. The pattern of labelling observed by 
NMR analysis is shown in Scheme 15 a. The key point to note is that two 
labels from the same PEG molecule become adjacent at C-15 and C-16, with no 
mixing with the unlabelled PEG. This shows that at some point during the pro- 
cess the pyrrole ring that provides ring D has been detached from its methylene 
group, C-11, turned round, and reattached by an intramolecular process. If 
uro’gen III is derived from [5-^^C]ALA without dilution with unlabelled ALA, 
then all the sites marked in 10a will be ^^C-labelled. Thus in the NMR 
spectrum all the enriched peaks appear as doublets except for G-15 which is a 
triplet and C-20 which is a singlet. This signature has been used many times to 
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'®C at either ♦ or A or ■ or • 




Scheme IS a, b. Labelling experiments to define the nature of the rearrangement catalysed by 
UROS 



identify tetrapyrrolic compounds derived from uro’gen III and to assist in the 
assignment of the NMR signals. 

The same conclusion with regard to the rearrangement can be drawn from 
two experiments using labelled aminomethylbilanes 45 b and 45 c, Scheme 15 b, 
but in addition these experiments showed that (i) the rearrangement occurs at 
the bilane stage and (ii) it occurs intramolecularly within the bilane. The amino- 
methylbilane 45 a is a product formed by HMBS when high concentrations of 
ammonia are present. It is not a substrate for UROS but is converted by HMBS 
into hydroxymethylbilane 30, the normal substrate for UROS. 

A number of modified hydroxymethylbilanes have been synthesised and 
tested as substrates for UROS [64-66]. These experiments showed that if the 
bilane is accepted as a substrate, the resulting porphyrinogen usually has ring D 
predominantly rearranged but a lesser amount is cyclised enzymically without 
rearrangement. Even in the case of the hydroxymethylbilane having ring D 
already rearranged this ring is rearranged back again by the enzyme 45 % of the 
time, giving uro’gen I, whereas 55% of the time the enzymic cyclisation occurs 
without rearrangement to give uro’gen III [64]. 

A likely mechanism for turning round ring D involves reaction of some 
electrophile at C-16 of the hydroxymethylbilane. This would then permit a 
fragmentation of the C-15/C-16 bond, allowing ring D to be turned over and 
reattached via its alternative a-position. In principle the electrophile could be a 
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proton or an iminium ion. However, the hydroxymethyl carbon of the substrate 
can generate a good electrophile and a bond needs to be made between this 
carbon and C-16 at some point during the mechanism. Therefore the most 
economical mechanism is that shown in Scheme 16. This has been termed the 
“spiro mechanism” because its key intermediate is the spiropyrrolenine 46. 

When the spiro mechanism was first proposed in 1961 [67], it was thought 
that the macrocyclic portion of the spiro intermediate 46 would be too strained 
and it was suggested that each pyrrole ring would have to be protonated on one 
of its a-carbons to engender greater flexibility in the macrocycle. In order to test 
whether this proposal is necessary, it was decided to aim to synthesise com- 
pounds having the same tripyrrolic macrocycle. Several examples of this type of 
macrocycle were made in Cambridge, all of which proved to be stable compounds 
[68]. Of the compounds synthesised, the one closest in structure to the spiropyr- 
rolenine 46 was spirolactam 48 [69] , which differs only in the replacement of the 
imine by an amide in the five-membered ring (Scheme 17). This spirolactam 
proved to be a potent inhibitor of UROS, binding about ten times more strongly 
than the substrate. The inhibition by spirolactam 48 strongly suggests that a 
closely related structure (i. e. spiropyrrolenine 46) is involved in the mechanism. 




HO- 

A P 

Hydroxymethylbilane 30 

A = CH2C02H, P^CHzCHaCOzH 

Scheme 16. The spiro mechanism for uro’gene III synthase 





Uro'gen III 10 




Scheme 17. Synthesis and absolute configuration of the spirolactam inhibitor of UROS 
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Further evidence for the specificity of the interaction of UROS with the 
spirolactam 48 came when its two enantiomers were separately prepared by 
including a resolution step halfway through the synthesis [70]. One of the two 
enantiomers inhibited UROS at least 20 times more strongly than the other. The 
unsubstituted parent spiro ring system of 48 does in fact have a plane of sym- 
metry and the difference between the enantiomers lies only in the arrangement 
of the acetate and propionate side-chains on the three pyrrolic rings of the 
macrocycle. This makes the discrimination shown by the enzyme between the 
two enantiomers all the more impressive. 

It was not initially possible to assign an absolute configuration to the 
strongly inhibiting enantiomer of spirolactam 48 and it took a good deal of 
work over a number of years before this problem was finally solved [71]. A 
large number of chiral derivatives were prepared related to the optically active 
dipyrrolyl lactam 47 from which spirolactam 48 had been synthesised. Un- 
fortunately none of these compounds proved to be crystalline and eventually 
attention turned to monopyrrolyl lactams such as 49. A derivative that was 
found to be highly crystalline and gave good X-ray data was the racemic N- 
nitrosolactam 51 . However, the individual enantiomers of this same compound 
failed to crystallise. It seemed that in this series the chances of crystallising a 
racemic mixture were higher than for a single enantiomer. Accordingly a novel 
approach was adopted in which the acid produced by hydrogenation of (-)-49 
was esterified with (R)-l-phenylethanol and its enantiomer was esterified 
with (S)-l-phenylethanol. The two products, which are the two enantiomers of 
a single diastereoisomer (50 or its epimer), were then mixed to give the race- 
mate and nitrosated. The resulting racemic nitrosolactam crystallised well and 
at long last it was possible to obtain a crystal structure, which showed that the 
product, though racemic, had the relative configuration 52 shown in 
Scheme 17. Thus the (R)-l-phenylethyl group had been attached to the lactam 
with the R configuration and it follows, therefore, that (-)-49 has the R con- 
figuration. 

Finally, correlation of the absolute configuration of monopyrrolyl lactam (R)- 
49 with that of dipyrrolyl lactam 47 was achieved by comparison of the circular 
dichroism spectrum of (R)-49 with the spectra of various dipyrrolic lactams 
derived from 47. The end result was that the strongly inhibiting enantiomer of 
spirolactam 48 also has the R configuration, as shown in Scheme 17 [71]. We can, 
therefore, be confident that if the spiropyrrolenine 46 is the natural intermediate, 
it also has the R configuration, shown in Scheme 16. 

Apart from the inhibition by spirolactam 48, enzymic studies on UROS have 
revealed little information on the mechanism of the reaction. The gene sequence 
is known for a number of different organisms and reveals very little homology 
between the amino acid sequences in different organisms. This is in contrast to 
HMDS in which large regions of high homology can be identified for organisms 
as different as E. coli and man. As a result no catalytic groups have been identi- 
fied. The presence of lysine and arginine groups in the active site has been infer- 
red from chemical modification studies [1,2, 28] but it is likely that these are the 
counterions for binding the anionic carboxylate groups of the substrate. A num- 
ber of mutations which affect the activity of human UROS have been discover- 
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ed by studying patients with congenital erythropoietic porphyria in whom the 
enzyme is defective [72]. 

Proposals for the detailed three-dimensional mechanism of UROS have been 
put forward [73, 74]. The mechanism really only requires one acidic group to 
protonate the HO group in the initial step and one basic group to perform the 
final deprotonation. It is possible that the same enzymic residue could perform 
both functions. Apart from this, the architecture of the active site must be such 
that, for the natural substrate 30, it favours the turning round of ring D and 
prevents the direct cyclisation to give uro’gen I. 

Uro’gen III synthases from most sources are particularly unstable and can be 
totally inactivated by a brief heat-treatment which leaves HMBS fully active. 
This instability would frustrate any attempts to crystallise the enzyme. However, 
it has recently been reported that the UROS from B. subtilis is much more stable 
than from other sources [75] and this opens up the possibility of crystallisation 
and an X-ray crystal structure determination. 

3 

Biosynthesis of Hydrogenobyrinic Acid in Pseudomonas denitrificans 



3.1 

Introduction 

Progress on the biosynthetic pathway from uro’gen III 10 to vitamin B 12 3 and 
coenzyme B 12 4 during the last 5-10 years has come almost entirely from re- 
search involving the aerobic B 12 -producer Pseudomonas denitrificans. There 
have been dramatic and surprising developments from the work on this orga- 
nism that have led to the elucidation of the complete biosynthetic pathway to 
vitamin Bj 2 . This rapid surge came largely from collaborative research carried 
out in Paris and in Cambridge. A detailed review of this vast effort has been 
published [76] and so in this review we will only highlight several key elements 
that pick out this work as a new departure in the way studies can be made of the 
biosynthesis of a complex molecule. Also, we will describe in some detail how a 
selection of the key biosynthetic intermediates were discovered and how their 
structures were determined on a micro-scale. The research on the other inter- 
mediates, not selected for full description, followed very similar lines and these 
structures are, of course, included in the presentation of the entire pathway to 
vitamin Bj 2 3 that forms the climax to this chapter. Schemes 29, 30 and 33. 



3.2 

Earlier Researches Leading to the Recent Advances 

Over the past 50 years or so, there has been a transformation in our knowledge 
of how natural substances are biosynthesised. This was achieved largely by 
methods based on isotopic labelling and generally involved the following steps: 
(i) discover what simple building blocks are used and where they fit into the final 
structure, (ii) from the pointers so gained, postulate likely advanced inter- 
mediates on the pathway, (iii) synthesise these putative intermediates in labelled 
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form to test whether they are transformed into the final product by the living 
system (or an extract from it), (iv) use the clues from (ii) also to search directly 
for new intermediates by analytical screening of the contents of the organism of 
interest. It was a very powerful approach that was also used in research on 
vitamin B 12 up to around the mid-1980s. The organism selected for that work 
was a micro-aerophilic one, Propionibacterium shermanii and without giving 
any details of that major effort, it showed that uro’gen III is the parent macrocy- 
cle used to construct vitamin Bj 2 in addition to the chlorophylls and haems. The 
step that triggers a branch in the biosynthetic pathway away from chlorophyll 
and haem towards vitamin Bj 2 is C-methylation with S-adenosyl-L-methionine 
(SAM) being the methylating agent. 

More C-methylations follow this first one and the research during this period 
also set in place the next three intermediates beyond uro’gen III 10 en route to 
vitamin B 12 4, Scheme 18 [1, 28, 77, 78]. They are precorrin-1 53 (or a tautomer 
of this structure), precorrin-2 12 and precorrin-3A 55; they can be seen as the 
mono-, di- and tri-methylation products of uro’gen III 10, Scheme 18. The 
figures after the name “precorrin” correspond to the number of C-methyl groups 
introduced into uro’gen III from SAM to generate that intermediate; the reason 
for using 3A rather than just 3 will become clear later. When labelled forms of 
precorrin-2 12 and precorrin-3A 55 were incubated with a crude cell-free en- 
zyme preparation from Pr. shermanii, they were converted into cobyrinic acid 
58, a late precursor of vitamin B 12 [1, 28, 77, 78]. 

The three foregoing intermediates 53, 12 and 55 are all readily oxidised to the 
corresponding aromatic chlorins or isobacteriochlorins when handled in air. 
Thus, 53 affords Factor 1 54, 12 gives sirohydrochlorin 56 and 55 is converted into 
Factor III 57. In fact, most of the research leading to the structures of these three 
precorrins was carried out on the stable dehydrogenated products [ 1, 28, 77, 78] . 
Fortunately, the aromatic systems 56 and 57 could be reduced either enzymical- 
ly or by catalytic hydrogenation to return to precorrin-2 12 and precorrin-3A 55, 
respectively [1, 28, 77, 78]. It should also be noted that sirohydrochlorin 56 is the 
metal-free macrocycle corresponding to sirohaem 7 [6-8], Scheme 1. 

Despite large efforts to isolate new intermediates from Pr. shermanii lying 
beyond precorrin-3A 55 on the pathway to B 12 , none were found. Also, following 
the approaches (i)-(iv) above, some putative later intermediates were prepared 
in labelled form but were found not to act as precursors of vitamin B 12 - It was 
clear that there were simply too many possible ways for the true pathway 
to go forward from precorrin-3A 55 to make this a realistic approach. A fresh 
approach was needed and the way the rapid advances over the past decade or so 
were achieved will be described in the remainder of this chapter. 



3.3 

Synergistic Combination of Biology, Chemistry and Spectroscopy 

Three of the most important changes that were made in the approach to the prob- 
lem were: (a) to add the power of genetics and molecular biology to the armory 
of other methods that had previously been used, (b) to use a different organism, 
the aerobic Bi 2 -producer Pseudomonas denitrificans, (c) to apply extensive 





Biosynthesis of Vitamin 



167 



Uro’gon III 10 





Scheme 18. Earlier knowledge of the pathway to vitamin Bjj in Pr. shermanii 

multiple labelling with carbon- 13 and then to harness the latest advances in 
pulsed NMR for determinations of structure. 

Step (a) was crucially important in that now the primary focus was on the 
enzymes catalysing the various conversions of one intermediate into the next; this 
then led forward logically to detection and isolation of the biosynthetic inter- 
mediates. The earlier approach had generally been the other way round. 
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The genes of Ps. denitrificans involved in the biosynthesis of coenzyme Bj 2 4 
are known as cob genes and the corresponding proteins that they encode are 
the Cob enzymes. This work involved analysis of a huge number of mutants 
(ca. 22000) of two aerobic Bi 2 -producing organisms, similar to Ps. denitrificans, 
for their ability to synthesise cobalamin [79]. As a result, 174 mutants were 
identified which could not carry out this synthesis, i.e. they lacked at least one of 
the set of genes needed for the biosynthesis of 4. The next step was to clone the 
Ps. denitrificans cob genes that restored cobalamin biosynthesis in these Cob 
mutants (“complementation”). A genomic library was constructed for this pur- 
pose which consisted of ca. 3600 separate strains of E. coli, each with a plasmid 
carrying a different DNA insert cut out from the Ps. denitrificans genome, with 
an average size of 1 3 Kb. The plasmid was capable of being transferred (or mobi- 
lised) into a range of other bacteria. The plasmid carried by every strain in the 
library was then mobilised in turn into each of the Bi 2 -blocked mutants above to 
see which plasmids restored their ability to biosynthesise cobalamin. Eleven 
plasmids from the library achieved this restoration for most of the blocked 
mutants. The 11 plasmids together carried 78 Kb of DNA from Ps. denitrificans. 
Further genetic analysis [80-84] of this 78 Kb of DNA in similar ways eventu- 
ally led to the identification of 22 cob genes that were needed for the biosynthesis 
of coenzyme B 12 4. They were named cobA to cobQ and cobS to cobW. The 
nucleotide sequence was then determined for 35.9 Kb out of the 78 Kb of DNA, 
including all 22 cob genes. These genes were organised in four clusters, some- 
what scattered, on the Ps. denitrificans genome. It was then possible to predict 
the amino acid sequences of the corresponding Cob proteins from the nucleo- 
tide sequences of the cob genes. 

Even this brief and somewhat simplified outline of these researches makes 
their great importance clear. At this stage, the identification of the 22 cob genes 
showed that 22 proteins (they turned out mostly to be enzymes) are involved in 
the biosynthesis of co enzyme Bj 2 4. Comparison of the amino acid sequences of 
the Cob proteins with sequences in protein databases led, for example, to identi- 
fication of the probable methyltransferases but apart from this the function of 
none of them was known nor was it known at what stage any of them acted in 
the building process. The untangling of this complex puzzle demanded sub- 
stantial quantities of the Cob proteins. Insertion of a particular gene, or set of 
genes, into an appropriate multicopy plasmid led to overexpression of the gene, 
so yielding the required material. Beyond this point, it was the turn of enzy- 
mology and analytical, structural and synthetic chemistry together with 
spectroscopy to play their complementary and essential part. The message 
deserves special highlighting that it was the combination of genetics, biology and 
chemistry that led to research on B 12 -biosynthesis moving forward successfully 
with a speed and completeness beyond anyone’s expectations. 



3.4 

First Steps of the Pathway in Ps. denitrificans 

It was soon found that the early stages of B 12 -biosynthesis in Ps. denitrificans 
follow those already established in Pr. shermanii. Here again, uro’gen III 10 is 
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methylated at C-2 and C-7 to form precorrin-2 12, Scheme 18; the double methy- 
lation is catalysed by a single enzyme, S-adenosyl-L-methionine:uroporphy- 
rinogen III methyltransferase (SUMT), encoded by the cobA gene. This enzyme 
was purified and characterised [85] , as was also that responsible for methylation 
of 12 at C-20 to form precorrin-3A 55, named S-adenosyl-L-methionine: pre- 
corrin-2 methyltransferase (SP 2 MT) and encoded by cobi [86]. The stereo- 
chemistry of the deprotonation steps at C-5, -10 and -20 during the conversion 
of 10 into 12 has been probed by incorporation of PEG chirally deuterated at 
C- 1 1 [87] . The result was the same whether (lli?)or(llS)-[ll -^H] PEG was used: 
all the deuterium is washed out at both C-5 and C-20, whereas most of the 
deuterium is retained at C-10. It is likely that the washout occurs by imine-ena- 
mine tautomerisation at intermediate stages of the process and that the final 
deprotonation at C-10 to form precorrin-2 12 is a non-stereospecific, non-enzy- 
mic process which largely retains the deuterium atom due to a normal isotope 
effect [87]. 

Over the years there had been no progress in identifying intermediates in 
Pr. shermanii between precorrin-3A 55 and cobyrinic acid 58 and at the outset 
for Ps. denitrificans the long pathway stretching forward from 55 to coenzyme 
Ej 2 4 was also unmapped. Detection of all the missing intermediates in Ps. 
denitrificans was made in a number of different ways; these will be illustrated by 
the examples selected for review and in each case the key element or elements of 
the successful approach will be italicised. Also, the central strategies for the 
structural and mechanistic research will be picked out in the same way. 



3.5 

Detection of Precorrin-6A and Elucidation of its Structure 

Many recombinant strains of Ps. denitrificans were generated in which various 
cob genes had been amplified. One of these overexpressed a complete cluster of 
the eight cob genes F, G, H, I, J, K, L and M and, intriguingly, it produced a sub- 
stance that showed a strong yellow fluorescence under ultraviolet light. This 
material was isolated and shown to be hydrogenobyrinic acid 60 [88], 
Scheme 19, the cobalt- free analogue of cobyrinic acid 58, Scheme 18. This pro- 




Scheme 19. Biosynthesis of hydrogenobyrinic acid from precorrin-3A 
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duct was ideal for biosynthetic studies of the B 12 -pathway in that it represented 
the complete macrocycle of the vitamin 3 and it had the advantage for handling 
of being metal-free. Importantly, a cell-free enzyme preparation from the above 
strain, rich in the eight enzymes encoded by the eight cob genes, efficiently 
transformed precorrin-3A 55 through all the many steps into hydrogenobyrinic 
acid 60 [89]. 

Two cofactors were found to be essential for the production of hydrogen- 
obyrinic acid 60 from precorrin-3A 55, namely SAM, as would be expected, but 
also reduced nicotinamide adenine dinucleotide phosphate (NADPH, partial 
structure 59) which was surprising. Scheme 19. Omission of NADPH from the 
incubation gave a critically important result; no 60 was formed but a new 
pale-yellow product appeared in its place. When a labelled form of this new 
pigment was incubated with the enzyme system, now with NADPH included, it 
was specifically converted into hydrogenobyrinic acid 60 in high yield. Clearly, 
a new intermediate for Bi 2 -biosynthesis had been found which opened the door 
to dramatic progress [89]. 

Appropriate labelling experiments by the French group [89] showed that pro- 
duction of the new intermediate involved three more methyl groups being 
added from SAM to precorrin-3A 55. The new substance was thus precorrin-6A 
and there were strong indications that precorrin-6A had a contracted macro- 
cycle. Further, by labelling the three new methyl groups of precorrin-6A with 
it was shown that these appeared at C-17, C-12a and C-1 of the hydro- 
genobyrinic acid 60 biosynthesised from it. Scheme 19. Mass spectrometry 
measurements on the methyl ester of precorrin-6A gave two unexpected results; 
(a) precorrin-6A was found to be an octacarboxylic acid, showing that the C-12 
acetate group was still intact, yet at some stage in the biosynthesis of hydro- 
genobyrinic acid it must undergo decarboxylation to form the 12jS-methyl 
group; (b) precorrin-6A corresponded in oxidation level to a dehydrocorrin 
(seven double bonds) not to a corrin (six double bonds) and therefore oxidation 
had occurred before precorrin-6A and there had to be a subsequent reduction 
to reach the final metal-free corrin 60. 

A major difficulty in elucidating the structure of precorrin-6A was that only 
small amounts (300-500 pg) of non-crystalline material was available from 
each enzymic run. This was overcome by using ^^C-labelled precursors at greater 
than 90 atom% enrichment to provide samples for analysis by pulsed NMR. 
Three samples of 5-aminolaevulinic acid, ALA 15, were synthesised, one 
^^C-labelled at C-5, the second at C-4 and the third at C-3, for enzymic conver- 
sion into three samples of precorrin-3A. Drawing on our understanding of the 
biosynthesis of uro’gen III, reviewed earlier in Sect. 2, the exact labelling pattern 
of each of these samples of precorrin-3A was known unambiguously; the 
pattern 55a for the sample from [4-^^C] ALA 15a is illustrated. Scheme 20. Each 
sample was converted enzymically into three specimens of precorrin-6A; in 
addition, [methyl-^^C]SAM was used in this last step for the experiments 
starting with [4-^^C]ALA and [3-^^C]ALA. This approach led to every atom of 
the macrocycle of precorrin-6A having been ^^C-labelled in one or other of these 
three final samples. The octamethyl ester of each one was studied by ^^C-NMR 
and by correlation spectroscopy, the latter technique picking out 
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Scheme 20. Example of '^C-labelling used for determination of the structure of precorrin-6A 



couplings through up to three bonds. This approach gave information not only 
about the nature and carbon connectivity of each labelled carbon but also 
allowed a connectivity pattern of^H and around the whole macrocycle to be 
established [90]. An independent set of connections from measurements of 
nuclear Overhauser effects interlocked with that from the couplings. 

A full account of these researches cannot be given here but the approach can 
be briefly illustrated by the experiments based on precorrin-3A 55 a from 
[4-^^C]ALA 15a and using [methyl-^^C]SAM to prepare precorrin-6A 61a, 
Scheme 20. The ^^C-NMR spectrum of the corresponding ester 62a showed 
direct connection of C-1 and C-19, thus confirming that ring-contraction had 
occurred. Also, C-19 was an sp^ centre whereas in hydrogenobyrinic acid 60 it is 
sp^, so indicating that the extra double bond in precorrin-6A 61 that is sub- 
sequently reduced is the one at C-1 8/C- 19. Very surprisingly, the NMR signals 
from the three ^^C-labelled methyl groups were all doublets showing that all 
were directly bonded to ^^C-atoms. The methyl groups at C-1 and C-17 were 
expected to be so attached, see 60, but not the third one. This was expected, a 
priori, to be sited on the unlabelled C-12, where it appears in hydrogenobyrinic 
acid 60 biosynthesised from precorrin-6A. It was clear that this methyl group 
must undergo rearrangement to C-12 in 60 from an adjacent site in precorrin- 
6A and the NMR evidence pointed to methylation at C-1 1 in the ester 62 [90]. 

That methylation had occurred at C-11 of precorrin-6A was rigorously pro- 
ved to be true by non-enzymic synthesis of [ll-^^C]uro’gen III lOd followed by 
enzymic conversion into [ll-^^C]precorrin-3A 55d. Finally, this product was 
transformed enzymically into precorrin-6A 61d using [methyl-^^C]SAM, 
Scheme 21. The ^^C NMR signal from the C-11 methyl group of the ester 62d 
was a doublet as was the signal from C-11 itself, the only ^^C in the macrocycle; 
proof of the C-11 methylation was complete [91]. 

By combining the information gained from all the foregoing experiments 
with similar data from the studies based on [5-^^C]ALA and [3-^^C]ALA, the 
surprising structure 62, Scheme 20, was established for precorrin-6A ester [90]; 
precorrin-6A itself is thus 61. The configurations at C-2, C-3 and C-7 are based 
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Precomn-6A ester 62d • = , R = Me 

Scheme 21. Proof that precorrin-6A carries a methyl group on C-11 



on the efficient enzymic conversion of precorrin-6A 61 into hydrogenobyrinic 
acid 60, where the configurations at these centres are firmly established. Also, the 
assumption was made that rearrangement of the methyl group from C-11 of 61, 
at some stage, to C-12 of 60 is intramolecular and therefore suprafacial. On this 
basis, the C-11 methyl group of precorrin-6A 61 was assigned the a-configura- 
tion since it is the C-12 a methyl group of the corrin system 58 that is derived 
from SAM [1, 28, 77, 78]. Experiments will be outlined in Sect. 3.8 that confirm 
the intramolecular nature of the rearrangement. Note that the C-11 methyl 
group blocks formation of a fully conjugated system resulting in the pale yellow 
colour of precorrin-6A. 

The structure 61 for precorrin-6A forced a complete change in the way the 
biosynthesis of vitamin Bj 2 was viewed. This structure was full of surprises: (a) 
the ring contraction step does not occur near the end of the pathway but has 
been completed at precorrin-6A 61 and is possibly carried out even earlier; (b) 
oxidation has occurred prior to the formation of precorrin-6A and reduction is 
needed after it; so the oxidation level along the pathway is not constant; (c) 
decarboxylation of the C-12 acetate group is not an early step; it occurs on the 
pathway beyond precorrin-6A; (d) the 12a-methyl group of hydrogenobyrinic 
acid 60 is introduced initially at C-11 with subsequent migration to C-12. 



3.6 

Precorrin-6B and the Regio- and Stereospecificity of Precorrin-6A Reductase 

Since precorrin-6A 61 had been accumulated by excluding NADPH from the 
incubation, it was highly probable that precorrin-6A was the substrate for a 
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reductase enzyme which had been rendered inactive by lack of its NADPH co- 
factor. Now having all the required enzymes in hand, they could be screened for 
their ability to transform precorrin-6A. It turned out that precorrin-6A was 
indeed enzymically reduced with the addition of two hydrogens to form pre- 
corrin-6 B; the letters A and B are used to distinguish two intermediates having 
the same level of methylation. Initially, a mixture of enzymes was used for this 
work in the absence of SAM to avoid further transformation of precorrin-6B 
[92] but later the pure reductase was isolated by the above screening approach 
and characterised [92]. Analysis of its amino acid sequence showed that this 
enzyme is encoded by the cobK gene. 

The structure determination for precorrin-6B followed the same approach 
outlined above for precorrin-6A 61 starting from [5-^^C]ALA and, separately, 
from [4-^^C]ALA; the latter experiment is illustrated in Scheme 22. The inter- 
mediate precorrin-3A 55 a yielded precorrin-6B 63 a and its octamethyl ester 

64 a was studied by NMR using the full plethora of techniques already described. 
As a result, the structure 64 was established for the ester and hence 63 for pre- 
corrin-6B itself [93]. To expand a little on that statement, C-19 of 63, which is 
directly bonded to C-1, was clearly an sp^ carbon whereas in precorrin-6A 61 it 
was sp^ hybridised. Thus, the C- 18/C- 19 double bond of 61 is the one that is re- 
duced in forming precorrin-6B, which then stands at the same oxidation level as 
hydrogenobyrinic acid 60. 

Now it was possible to answer two further questions: (a) which of C-1 8 or 
C-19 receives the hydride equivalent from NADPH and (b) from which face of 
NADPH 59 is the hydride delivered. The answer to (a) came from the synthesis of 
[4-^H2]NADPH with 95 atom% at C-4 59a, which was used for the enzymic 
reduction of precorrin-6A to yield ^H-labelled precorrin-6B. Analysis of this 
product by ^H-NMR and mass spectrometry then showed that the ^H had been 
transferred solely to C-19 [94], Scheme 23. On mechanistic grounds, it seems 
certain that the transfer of the hydride ion occurs to the C-18 protonated form 

65 of precorrin-6A. 




Scheme 22. Structural study of precorrin-6B by '^C-labelling 
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Scheme 23. Stereo- and regio- specificity of reduction of precorrin-6A to precorrin-6B 



Question (b) was answered by synthesising [4R-^H]NADPH 59b and 
[4S-^H]NADPH 59 c, Scheme 23, which were used in separate experiments to pro- 
duce two samples of precorrin-6B. NMR analysis as before proved that was 
only transferred to precorrin-6B 63 from 59b so the reductase specifically 
transfers of the cofactor 59 to the substrate [95]. 



3.7 

Isolation of Precorrin-8x and Proof of its Structure 

Here again, the remaining Cob enzymes were screened for their ability to use 
precorrin-SB 63 as substrate. The successful assay was based on checking for 
transfer of the methyl group from [ methyl-^H] SAM to precorrin-6B. It was likely 
that the next enzyme might be a methyltransferase because hydrogenobyrinic 
acid carries two methyl groups more than precorrin-6B. The selected enzyme 
proved to be encoded by the cobL gene and, when isolated on a preparative scale 
and purified to homogeneity, CobL smoothly transformed precorrin-6B 63 into 
a new substance carrying two more methyl groups. The new intermediate, 
named precorrin-8x, is efficiently converted enzymically into hydrogenobyrinic 
acid 60, thus establishing its status as a new Bi 2 -precursor [96, 97]. Dimethyla- 
tion by one enzyme was not by now a surprise since precorrin-2 12 is also bio- 
synthesised in this way. However, the finding by mass spectrometry that pre- 
corrin-8x is a heptacarboxylic acid was certainly unexpected [97]. This means 
that the C-12 acetate had undergone decarboxylation and so CobL acts both as 
a methyltransferase and a decarboxylase, two very different processes. It may be 
that cobL has arisen by the fusion of two ancestral genes [96] and support for this 
interpretation comes from the description of two separate genes of the B 12 
pathway in Salmonella typhimurium, cbiE and chiT, which show strong homology 
with the amino-terminal and carboxy- terminal regions of cobL, respectively [98]. 

The structural work on precorrin-8 x followed the ^^C-labelling methods alrea- 
dy fully described [99]. In this case, however, there were frustrating difficulties. 
The ester of precorrin-8 x was unstable so the NMR studies were carried out on 
the free acid. This isomerised in aqueous solution to generate first a mixture of at 
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least five isomers and eventually one stable form. Separation of the former mixtu- 
re by HPLC allowed each isomer to be tested as a biosynthetic precursor of hydro- 
genobyrinic acid 60 and it was found that the true precursor so identified could be 
stabilised at high pH for spectroscopic study. One set of experiments differed from 
the earlier examples in that the labelled starting material was [2,3-^^C2]ALA 15b, 
Scheme 24. The reason for using this particular labelling pattern was because 
information from correlation spectroscopy on the small scale involved is 

only available at and around ^^C-labelled sites (too many of the peaks overlap in 
the NMR spectrum for this to be used on its own). In this way, the nature of 
C-12 and its attached carbon in precorrin-8x could be studied [99] and these 
carbons are highlighted by using filled circles on structure 66 b in Scheme 24. 
In addition, a full ^^C-labelling study [99] on the stable form of precorrin-8x 
established its structure 67. This knowledge of the structure of an isomer of the 
true intermediate was invaluable for the subsequent research on the latter. 

The sum of all the resrdts from ^^C-NMR and ^H-NMR on both the true inter- 
mediate and its stable isomer led to structure 66 for precorrin-8x [99] and its 
key features are; (a) the two new methyl groups appear at C-5 and C-15, the lat- 
ter being an sp^ centre like C-10; (b) decarboxylation of the C-12 acetate group 
has been followed by tautomerisation to generate a methyl group at C-12; (c) 
there are several opportunities in structure 66 for double-bond tautomerism 
and also for epimerisation which probably account for the ready formation of 
the isomers reported above; (d) precorrin-8x 66 is isomeric with hydrogeno- 
byrinic acid 60 and so is poised for the final rearrangement step that generates 
the corrin macrocycle. This step is covered in the following Sect. 3.8. 



3.8 

Migration of the C-11 Methyl Group to C-12 to Form the Corrin Macrocycle 

Enzymic catalysis of a rearrangement reaction has special mechanistic 
fascination and, for vitamin B 12 , a methyl group at C- 1 1 of 66 shifts to C- 1 2 of 60, 
Scheme 25. This step clears the blockage to movement of the double bonds so 
tautomerisation can now occur, with thermodynamic gain, to form the extended 
conjugated system of hydrogenobyrinic acid 60. The enzyme responsible for 




Preoorrln-Sx 66 o = • = ’^ Stable form of pfeoorrin-8x 67 

66b o = • ='^C 



Scheme 24. Structure elucidation for precorrin-8 x 
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catalysing this rearrangement process was eventually detected and its td-terminal 
sequence showed it to be encoded by the cobHgene [97], Construction of a strain 
of Ps. denitrificans having cobH as the only amplified gene then allowed a pro- 
tocol to be developed for isolation of pure CobH protein. This turned out [97] to 
be a relatively small monomer (M^ 22000 ± 1000 ) which further increases its 
attractiveness as a target for X-ray or NMR structure determination. 

The mechanism for the migration of the methyl group from C-11 to C-12 is 
illustrated in Scheme 25 as an intramolecular [l,5]sigmatropic rearrangement. 
If this is true, the migration will be suprafacial and it was on this assumption that 
the C-11 methyl groups of precorrins -6 A, - 6 B and - 8 x were assigned the a-con- 
figuration. It was essential to gain direct evidence that this was so because hard 
experience had taught the Paris-Cambridge teams that with vitamin B 12 , one can 
assume nothing. Far too often, what had seemed likely proved to be wrong. 

The decision between an intramolecular mechanism and an intermolecular 
one was made by a cross-over experiment. [^^CgjPrecorrin-SA was prepared 
enzymically [100] from [3-^^C]ALA and part was biosynthetically converted 
into [^^C 8 ]precorrin- 6 A using unlabelled SAM. The rest was similarly converted 
into [^^C 8 ,(CD 3 ) 3 ]precorrin- 6 A by using [methyl-^H^]SAM. A mixture of equal 
quantities of the two labelled samples of precorrin- 6 A was then enzymically 
converted into hydrogenobyrinic acid using unlabelled SAM. Mass spectro- 
metric study of this final product proved that there had been no mixing of the 
CH 3 and CD 3 groups and hence that the methyl migration is intramolecular 
[101]. The illustrated [l,5]sigmatropic mechanism, Scheme 25, is thus supported 
and the a-configuration assigned to the C-11 methyl group of precorrin- 6 A 
61 and later intermediates is secure. 

It should be noted that the ^^C-labelling in the above studies was not, unlike 
earlier, for NMR analysis but simply to increase the mass so that the biosyn- 
thesised hydrogenobyrinic acid would be distinguished in the mass spectrum 
from the huge peak due to endogenous material appearing 8 mass units lower. 

The position had now been reached where the biosynthetic route to hydro- 
genobyrinic acid 60, a precursor of vitamin B 12 , had been mapped out from ALA 
through several steps to uro’gen III 10, before going forward along the “methy- 
lation pathway” to precorrin-2 12 and precorrin-3A 55. Then there was a gap to 
be filled before precorrin- 6 A 61 and subsequent intermediates. We will now 




PreoofTin-Sx 66 HydroQenobyrinic add 60 



Scheme 25. Methyl migration from C-11 to C-12 in the formation of hydrogenobyrinic acid 
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describe how this final gap was filled to complete the entire biosynthetic 
pathway to coenzyme B 12 . 



3.9 

Unexpected Structure of Precorrin-4 

The approach leading to the discovery of precorrin-4, the tetramethylated B 12 - 
intermediate, differed from those used previously. A strain ofPs. denitrificans 
was constructed having the cobM gene deleted and thus unable to produce the 
CobM enzyme. This strain yielded a cell-free enzyme system capable of con- 
verting precorrin-3A 55 into a new product, precorrin-4 [102]. Clearly, precor- 
rin-4 is the substrate for CobM but is not further transformed, and so accumu- 
lates, because that enzyme is lacking. Precorrin-4 was readily oxidised by air to 
give its dehydro derivative. Factor IV, a step that could be reversed by enzymic 
reduction. Thus when Factor IV was incubated with the complete enzyme 
system, including NADH but lacking NADPH, precorrin-4 was regenerated and 
was carried forward to precorrin-6A 61. This conversion of precorrin-4 into an 
established intermediate for B 12 sets precorrin-4 securely on the biosynthetic 
pathway to the vitamin [102]. The relationship of precorrin-4 to Factor IV 
mirrors the relationships already well known for precorrin-2 12 and precorrin- 
3 A 55 to their oxidised forms 56 and 57, Scheme 18; in these cases too the struc- 
tural work had largely been carried out on the corresponding dehydro deriva- 
tives which also could be reduced enzymically to the original intermediates. 

The structural studies [102] on Factor IV followed exactly the ^^C-labelling 
methods described already, now well developed, and led to the constitution 68, 
Scheme 26. This gave the crucial information that the formation of precorrin-4 




Scheme 26. Proof that the acetyl group of precorrin-4 is sited at C-1 
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involved an oxidative ring-contraction and C-17 methylation of precorrin-3A 
55; thus structure 69 could confidently be deduced for precorrin-4 [103]. Then 
further NMR evidence in support of structure 69 was adduced [104]. The point 
was made earlier that an enzymic oxidation step must precede the formation of 
precorrin-6A 61; the structure of precorrin-4 69 now showed that this oxidation 
had already occurred by the tetramethylated stage. Besides this interest of struc- 
ture 69, there were also two surprises, one being the unexpected early ring-con- 
traction, the other being related to the acetyl group. Though formation of an 
acetyl residue during ring-contraction was expected on mechanistic grounds, 
for the NMR evidence to place it at C-1 was a surprise. Everyone’s earlier specu- 
lations had placed the acetyl group at C- 19 (actually in a postulated intermedia- 
te which would have been much later in the pathway). Interlocking evidence for 
the C-1 acetyl group came from the non-enzymic synthesis of [1,10,20- ^^C 3 ]uro’- 
gen III lOe, which was converted enzymically via precorrin-3A into precorrin-4 
69 e. Scheme 26. The ^^C-NMR spectrum of the derived Factor IV showed 
doublets from the acetyl carbonyl group and from C-1, demonstrating direct 
bonding and thus making it certain that precorrin-4 69 carries its acetyl group 
at C-1 [105]. 

From the structure of precorrin-4 69 and its status as substrate for the CobM 
enzyme, clearly a methyltransferase from its sequence, the exact function of this 
enzyme could be deduced. In Clostridium tetanomorphum [I06-I08], Pr. 
shermanii [109] and Ps. denitrificans [88], the order of insertion of the various 
methyl groups into the corrin macrocycle, after the third one in precorrin-3 A 55, 
had been revealed by pulse labelling and this order was the same in all three. The 
fourth one was placed at position 1 7, then at 1 2 a followed by 1 and, finally, 5 and 
15. Precorrin-4 69 shows the C-17 methyl group in place and Sect. 3.5 reported 
that the 12a-methyl of the corrin system is initially inserted at C-1 1. It is that 
C-II methylation which is blocked by deletion of the cobM gene and hence 
CobM is the 11 -methyltransferase of Ps. denitrificans. 

3.10 

Oxidative Conversion of Precorrin-3A into Precorrin-3B 

The researches reviewed so far had revealed the functions of five of the eight 
enzymes overproduced by the special strain of Ps. denitrificans described 
earlier. These five were CobI, M, K, L and H, set here in the order in which they 
act on the biosynthetic pathway for hydrogenobyrinic acid 60 en route to 
vitamin B 12 3. Three enzymes were still available, CobG, CobF and CobJ, that 
together with the 11 -methyltransferase, CobM, carry out the conversion of 
precorrin-3A 55 into precorrin-6 A 61 via precorrin-4 69. So the enzymes for the 
two methylations at C-17 and at C-1 remained to be identified. 

Strains of Ps. denitrificans were constructed in which cobG, cobJ and cobF had 
been separately amplified [110]. The predicted amino acid sequences of the three 
corresponding enzymes shotved that CobF and CobJ are methyltransferases 
whereas CobG was quite different. Thus two methyltransferases were available 
for the C-17 and C-1 methylations with CobG being the remaining candidate to 
catalyse the oxidation step which precedes precorrin-4 69. 
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This analysis interlocked with the findings that (a) a strain of Ps. denitrificans 
from which cobG had been deleted accumulated precorrin-3A 55 [111] and (b) 
whereas CobJ alone had no effect on precorrin-3A [111], CobG and CobJ 
together converted it into precorrin-4 69 and hence CobG acts before CobJ 
[110]. When tested, CobG alone converted precorrin-3A 55 into a new inter- 
mediate, named precorrin-3B, generated solely by oxidation without further 
methylation [112], observations which were independently confirmed [104]. Its 
molecular weight was 16 units higher than that of precorrin-3A corresponding 
to the addition of one oxygen atom to precorrin-3A 55. The structure of pre- 
corrin-3B was determined by multiple ^^C-labelling as earlier but, in addition, a 
strong band at 1799 cm~^ in the IR-spectrum showed precorrin-3B to be a 
y-lactone. The combined evidence established structure 70, Scheme 27, for pre- 
corrin-3B [112, cf. 104]. Interestingly, the oxidation by CobG does not cause 
ring-contraction; rather the precorrin-3B molecule is primed ready for contrac- 
tion in the next step, catalysed by CobJ, together with C-17 methylation to form 
precorrin-4 69. 

The source of the oxygen atom added above to precorrin-3A was shown to 
be molecular oxygen [104] which was required to allow CobG to turn over. 
Also the oxygen atom was pinpointed as that in the G-20 hydroxyl group by 
using ^®0-isotope induced ^^C-NMR shifts [113]. Whether CobG acts as a 
mono -oxygenase or a dioxygenase was unknown as it was mechanisti- 
cally possible that a second oxygen atom had been incorporated to provide the 
lactonic oxygen attached to C-1 of 70. Because of the positioning of the 
^^C-atoms in the foregoing NMR study, this would not have been detected. 
Conversion of precorrin-3A 55 into precorrin-3B 70 and precorrin-4 69 
with ^*©2 followed by mass spectrometric study of both products showed that 
only one oxygen atom is incorporated [114]. This result together with that 
above showed that both lactonic oxygen atoms of precorrin-3B 70 are the 
original ones from the C-2 acetate side-chain of precorrin-3A 55. These and 
similar studies also showed that the C-2 acetate oxygens are also retained in 
precorrin-4 69 [114] and precorrin-5 [115], the topic of Sect. 3.11. The interest 
of these results and the following one will become clear from the discussion in 
Sect. 5. 




Scheme 27. Biosynthesis of precorrin-3B from precorrin-3A 
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Further experiments were designed to test whether the oxygens of the C-2 
acetate in precorrin-3 A 55 continue to be preserved beyond precorrin-5 through 
the remaining steps leading to hydrogenobyrinic acid 60, see Scheme 30. First, 
precorrin-3A 55 biosynthesised to carry eight groups, was converted 

into hydrogenobyrinic acid 60 by the overproduced enzymes of Ps. denitrifi- 
cans. The ^®0-induced ^^C-NMR shifts for the seven distinguishable signals 
from the carboxyl groups of the labelled 60 showed that both oxygen atoms had 
been retained in all seven carboxyls, including the important acetate at C-2 
[101]. This result was confirmed by mass measurements using electrospray 
mass spectrometry, an invaluable technique for handling such highly polar 
molecules. 

The enzyme, CobG, central to the foregoing research, is a fascinating iron- 
sulfur protein with a brown-green colour. Its content of iron and sulfur points to 
one [Fe 4 S 4 ] or two [Fe 2 S 2 ] clusters being present [110] and no cofactors besides 
oxygen need be added for full activity. 

3.11 

Formation and Structure of Precorrin-5 

By this stage, only the pentamethylated intermediate, precorrin-5, needed to 
be added to fill the last gap in the pathway to hydrogenobyrinic acid 60. As 
outlined in Sect. 3.9, conversion of precorrin-4 69 into precorrin-5 is blocked 
when CobM, the 1 1 -methyltransferase, is excluded. Also, only one enzyme, 
the methyltransferase Cob F, remains to catalyse the necessary C-1 methylation 
required to go forward from precorrin-5 to precorrin-6A 61. This knowl- 
edge led to two approaches for the production of precorrin-5, either from 
precorrin-3A 55 by incubation with CobG, CobJ, CobM and SAM but with- 
out CobF [110] or from precorrin-4 69 by using CobM and SAM [116]. 
The structure 71, Scheme 28, corresponding to ll-methyl-precorrin-4, was 
determined for precorrin-5 by the multiple ^^C-labelling approach as earlier 
[116]. 

Comparison of the structures of precorrin-5 71 and precorrin-6A 61 
raises the problem of how the C-1 methyl group of the latter is introduced. 
An attractive rationalisation is shown in Scheme 28 initiated [117] by pro- 
totropic rearrangement of precorrin-5 71 to the isomer 72, now susceptible 
to hydrolytic elimination of the acetyl group as acetic acid by a reverse 
Claisen process. The extended enamine 73 so formed then allows [118] ready 
C-methylation at C-1 to give a compound which tautomerises to the more 
stable extended amidine system [119] in precorrin-6A 61. This chemistry is 
undoubtedly enzyme-catalysed and controlled in the living organism but 
draws on the intrinsic reactivity of the system: cleavage of the acetyl group 
occurred as precorrin-5 was handled, which led to isolation of the deacetyla- 
ted product, named Factor V [ 1 1 0] . The same loss of the acetyl group was also 
observed with a derivative of precorrin-5 obtained by acid-catalysed 
esterification [116]. Finally, the eliminated fragment in the enzymic conver- 
sion of precorrin-5 71 into precorrin-6A 61 was confirmed as acetic acid by 
direct isolation [120]. 
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Scheme 28. Proposed mechanism for the introduction of the methyl group at C-1 of pre- 
corrin-6A 

4 

Complete Pathway from 5-Aminolaevulink Acid 
to Hydrogenobyrinic Acid 

All the Bi 2 -intermediates discovered as a result of the experiments outlined in 
the main part of Sect. 3 have been assembled in Scheme 30 to show the entire 
biosynthetic pathway from precorrin-3A 55 through to hydrogenobyrinic acid 
60. By any measure, it is a remarkable sequence rich in surprises and unexpect- 
ed chemistry. The early biosynthetic route from ALA 15 to precorrin-3A 55 is 
shown in Scheme 29, so Schemes 29 and 30 in combination show the complete 
story. Space only allows discussion of some of the more intriguing mechanistic 
aspects of Scheme 30; a fuller account is available [76]. 

The early steps. Scheme 29, used in common for haem, chlorophyll and vita- 
min Bi 2 , involve condensation of two molecules of ALA 15 to form PBG 16 and 
four molecules of PBG are then combined to build uro’gen III 10 by way of 
hydroxymethylbilane 30. Ring-closure of 30 to give 10 involves a fascinating 
intramolecular rearrangement. It is difficult to imagine a more compact route 
for the synthesis of uro’gen III 10. This product stands at the branch point where 
the pathway to B 12 separates from that to haem and chlorophyll, the split being 
initiated by a double C-methylation of 10 by CobA to yield precorrin-2 12. A 
further C-methylation at C-20 by Gobi generates precorrin-3A 55. All three 
methylations, and also the remaining five needed to reach hydrogenobyrinic 
acid 60 are essentially C-methylations of enamines or extended enamines. 
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Scheme 29. Complete pathway from ALA through to precorrin-3A 



Scheme 30 starts with the oxidation of precorrin-3A 55, catalysed by CobG 
again with attack at the jS-carbon of an enamine, to yield the lactonic precorrin- 
3B 70; the possible involvement of lactones in B 12 -biosynthesis was first sug- 
gested by Eschenmoser [119]. The oxidant is molecular oxygen and the single 
oxygen atom inserted forms the C-20 hydroxyl group. Precorrin-3B 70 is poised 
for a pinacol-type rearrangement [119] leading to early ring-contraction. This is 
illustrated in Scheme 30 as preceding the C-17 methylation that also occurs in 
this CobJ-catalysed generation of precorrin-4 69, which would allow the elec- 
tron-rich pyrrole nucleus to be the migrating group. The next steps leading to 
precorrin-6A have already been discussed in Sect. 3.1 1, Scheme 28, and the reac- 
tions shown there share a common feature with all the chemistry in Scheme 30, 
in that each step sets up the reactivity needed for the next one. Thus, methyla- 
tion of precorrin-4 69 by CobM at C-1 1 gives precorrin-5 71 shown in Scheme 28 
to be ready for loss of the acetyl group so allowing C-1 methylation to afford pre- 
corrin-6A 61. This was the intermediate whose discovery initiated the surge in 
knowledge of Bi 2 -biosynthesis. Reduction of 61 by CobK, almost certainly as the 
C-1 8 protonated form, sets the oxidation level of the product, precorrin-6B 63, 
to match that of hydrogenobyrinic acid 60. The unusual enzyme, Gobi, having 
both methyltransferase and decarboxylase activities, then acts on precorrin-6 B 
63 to insert methyl groups at C-5 and C-15 and to decarboxylate the C-12 acetate 
residue. It is attractive to suggest that the methylation at C-15 triggers the decar- 
boxylation by converting ring-C into a protonated pyrrolenine ideally set up for 
decarboxylation. This chemistry leads to precorrin-8x 66 having ring-C as a 
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5,5-disubstituted pyrrolenine and such systems are known to rearrange [121]. 
The rearrangement of 66, catalysed by CobH, moves the C-1 1 methyl group to 
C-12, so allowing the full conjugated system of the corrin ring of hydrogen- 
obyrinic acid 60 to develop. 

5 

Biosynthesis of Cobyrinic Acid in Propionibacterium shermanii 

Though the complete pathway to coenzyme B12 4 in the micro-aerophilic 
organism, Pr. shermanii, is not yet available, much is known and a brief com- 
parison with the fully mapped route used in the aerobic Ps. denitrificans now 
follows. “Micro-aerophilic” means “almost anaerobically” and under these con- 
ditions Pr. shermanii biosynthesises cobyrinic acid 58, Scheme 18, and some of 
its amide derivatives. The nucleotide loop, however, is constructed only in the 
presence of oxygen [122]. 

As already brought out in this review, the first part of the pathway in Pr. 
shermanii is identical to that in Ps. denitrificans from ALA 15 through six steps 
to precorrin-3A 55, Scheme 29. Then there is a difference in that cobalt is in- 
serted late in the latter organism (see Sect. 6 and Scheme 33) whereas there is 
early insertion in Pr. shermanii, most likely at the dimethylated but possibly at 
the trimethylated intermediate. This conclusion stemmed from three sets of 
experiments; one [123] studied the incorporation of cobalt complexes of the 
aromatised derivatives 56 and 57 of precorrin-2 12 and precorrin-3A 55, 
Scheme 18. The second [124] used a novel pulse-labelling technique involving 
the three isotopes and ®°Co^'^. The latter method showed, in addition, that 

the incorporation of cobalt was rapid and that the complexed ion was then held 
securely with no significant exchange with cobalt ions in the medium through- 
out the remaining steps to cobyrinic acid 58. Enzymic experiments on the 
insertion of were used in the third approach [125]. 

It was reported in Sect. 3.10 that both oxygens of the C-2 acetate of precorrin- 
3 A 55 are retained through all the biosynthetic steps in Ps. denitrificans leading 
to hydrogenobyrinic acid 60. This is in contrast to what had been found earlier 
for Pr. shermanii [126, 127]; when this organism incorporated [1-^^C,1,1,4- 
^®03]ALA 15c into vitamin B12 3, the C-2 acetamide group of the latter had 
undergone substantial loss of this loss was not experienced by any of the 
other carboxamide groups. It was further established by the same approach that 
already by the stage of cobyrinic acid 58 c, one oxygen atom had been lost from 
the C-2 acetate [128] and the other six carboxyl groups retained both oxygens. 
Scheme 31; the NMR spectra were determined using the corresponding hepta- 
methyl ester, cob ester. 

The first information about the fate of this lost oxygen came from studying 
the acetic acid, which was the isolated form (but see below) of the C2 fragment 
that is eliminated during the ring-contraction process [129, 130]. Cobyrinic 
acid 58 was biosynthesised from [5-^^C,l,l,4-^®03]ALA 15d, which ^^C-labels 
C-20 of precorrin-3A 55. The ^®0-induced shift of the ^^C-NMR signal given by 
the carbonyl carbon of the isolated acetic acid showed that it had received one 
from a carboxyl side-chain which must be that of the C-2 acetate [131]. It 
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Scheme 31. Labelling with '*0 and '^C to study the fate of the carboxyl oxygens 



follows that the oxygen of this C-2 acetate becomes bonded to C-20 in the ring- 
contraction process rather than to C-1 as happens in Ps. denitrificans, see 
Sect. 3.10. 

Evidence about the nature of this process has come from the isolation in a 
joint Stuttgart-Texas effort of a cobalto tetramethylated macrocycle from Pr. 
shermanii, shown by multiple ^^C-labelling to have structure 74, Scheme 32 
[132]. Presumably this compound, which is epimeric at C-8 relative to cobyrinic 
acid 58, is formed by enzymic methylation of 8-epi-precorrin-2 (8-epi 12) and 
the epimerisation caused derailment of the biosynthesis, allowing its isolation. 
The compound 74 is at the dehydro level of oxidation relative to what is needed 
for conversion forward into cobyrinic acid 58 and is considered to have been 
oxidised during isolation just as, for example, precorrin-2 12 was in the early 
studies. Its conversion into cobyrinic acid 58 thus requires both enzymic reduc- 
tion and epimerisation at C-8, which would explain the low observed level of 
incorporation (0.6%). 

A mechanism for the formation of 74 has been proposed [132] which has 
much in common with earlier ideas based on model studies by Eschenmoser 
and coworkers [119, 133, 134]. In addition, it is suggested that the dihydro 
form of 74 undergoes methylation at C-11, similar to that in Ps. denitrificans 
to yield 75. This could undergo a retro-aldol step to release acetaldehyde. 
Scheme 32. Indeed, when the 2-carbon fragment was immediately trapped, 
it was found to be acetaldehyde [135]; the earlier isolation of acetic acid re- 
sulted from oxidation of acetaldehyde by long incubation with the mixture of 
enzymes. 

The outcome of the retro-aldol step would be the compound 76 in Scheme 32, 
which is the cobalt complex of 73, ready for C-1 methylation (cf Scheme 28) to 
generate cobalto-precorrin-6 A 77. If this is correct, the pathways for biosynthetic 
modification of the organic macro cycles in the two organisms merge again at 76. 

This remerging of the chemistry in the two organisms had been supported by 
earlier studies [125] in which some of the intermediates isolated from Ps. 
denitrificans were incorporated into cobyrinic acid 58 by an enzyme prepara- 
tion from Pr. shermanii. Precorrin-6 A 61, precorrin-6B 63 and precorrin-8x 66, 
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Scheme 32. Proposed mechanism for the conversion of 74 into cobalto-precorrin-6 A 



Scheme 30, in doubly labelled form, were all incorporated into cobyrinic 

acid 58 with unchanged ratio. The incorporation levels (0.9-2. 7%) were 

reasonable bearing in mind that these three macrocycles need to sequester 
cobalt from the medium to enter the biosynthetic pathway in Pr. shermanii. 

In addition, it was proved [136] that an enzymic reduction which transfers a 
hydride equivalent to C-19 occurs as cobyrinic acid 58 is biosynthesised in Pr. 
shermanii. This finding fits perfectly with conversion of cobalto-precorrin-6A 
77 into cobalto-precorrin-6B (Co-63); the equivalent reductive step in Ps. deni- 
trificans was discussed in Sect. 3.6. Also, the stereospecificity of the transfer, 
employing 4-Hj{ of the reduced cofactor, was the same in both biological systems 
[136]. 

Thus, the picture at present is that the differences between Ps. denitrificans 
and Pr. shermanii are (a) cobalt is inserted late in the former and early in 
the latter and (b) the ring-contraction process involves intriguingly different 
chemistry around C-1, C-20 and the C-2 acetate side-chain in the two orga- 
nisms before the same organic macrocycles are used again. Of the 22 or so 
steps from ALA 15 to coenzyme 6^2 4, it already appears that some 16 involve 
either identical substances in the two organisms or they use the same organic 
ligand. The biosynthesis of coenzyme B12 in the two organisms is thus based 
on the same biosynthetic theme with two variations. This reinforces our 
thinking that the pathways used by the two organisms are quite close in evolu- 
tionary terms. 
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6 

Biosynthesis of Coenzyme Hydrogenobyrinic Acid 

Our plan was to focus this Chapter largely on the biosynthesis of hydrogeno- 
byrinic acid 60. Nevertheless, it is important to show even briefly how coenzyme 
Bi 2 4, Scheme 33, is constructed from 60. Besides the intrinsic value of this 
knowledge, it considerably helped the researches described in Sect. 3 because by 
detecting those genes acting beyond hydrogenobyrinic acid 60, the problem of pin- 
pointing the ones required to build the corrin macrocycle 60 was greatly simplified. 

Scheme 33 shows the steps elucidated [76] for the biosynthesis of coenzyme 
Bi 2 4 from hydrogenobyrinic acid 60 in Ps. denitrificans. This collects together 
the knowledge generated for this organism by the French teams of Blanche and 
Crouzet. A more complete review of this work and that of others is available [76]. 

The substrate for cobalt insertion was found to be the a,c-diamide 78 gener- 
ated from hydrogenobyrinic acid 60 by CobB. Remarkably, the superficially 
simple step of cobalt insertion requires the cooperation of three proteins, CobN, 
Cobs and CobT, to form a cobalt cheletase system which is ATP-dependent. The 
product, cob(II)yrinic acid a,c-diamide, is reduced to the Co(I) state by a flavin 
dependent enzyme (gene as yet unknown), which was isolated as pure protein. 
The highly nucleophilic Co(I)-complex was thus ready for adenosylation at 




Scheme 33. Complete pathway from hydrogenobyrinic acid to coenzyme B 12 
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cobalt by ATP catalysed by CobO to afford adenosylcobyrinic acid a,c-diamide 
79. Amidation of three of the propionate side-chains and the one remaining 
acetate side-chain is then carried out by CobQ to form adenosylcobyric acid 80 
leaving the fourth propionate, on ring D, available for attachment of the pro- 
panolamine unit. The latter is derived from threonine but, since no threonine 
decarboxylase has ever been found, a 3-step mechanism was considered in- 
volving enzymic dehydrogenation of threonine to 2-amino-3-ketobutyrate then 
decarboxylation followed by reduction to aminopropanol. Direct studies on Pr. 
shermanii [137] and Salmonella typhimurium [138] showed that this sequence 
is not followed, however. Fortunately, relief from the impasse has appeared in a 
recent report [139] that 0-phospho-L-threonine is involved in the process. The 
attachment of the propanolamine unit requires a protein designated a together 
with CobC and CobD to afford adenosylcobinamide 81. Then follows the ATP- 
dependent phosphorylation of the propanolamine hydroxyl group catalysed by 
CobP to yield 82. CobP is yet another multifunctional Cob enzyme in that it is 
also responsible for the transfer of a guanosyl monophosphate (GMP) unit from 
the triphosphate (GTP) to 82 affording adenosyl-GDP-cobinamide 83. 

A separate pathway has evolved to build a-ribazole 84, unusual both in its 
base, 5,6-dimethylbenzimidazole, and in the a-linkage to ribose. Either a- 
ribazole 84 or its 5'-phosphate 85 is able to accept the adenosylcobinamide 
phosphate residue from adenosyl-GDP-cobinamide 83, a transfer catalysed by 
CobV, to form adenosylcobalamin (coenzyme B 12 , 4) or its 5'-phosphate, re- 
spectively. Dephosphorylation of the 5'-phosphate then leads to 4. Thus the end 
product from nature’s marathon, coenzyme B 12 4, has been reached, a structure 
of marvellous architecture and amazing biological activity. 

Before the foregoing researches on Ps. denitrificans, little was known in any 
organism about the enzymes involved in building the nucleotide loop of 4 and 
structural studies on intermediates were carried out on materials lacking the 
adenosyl group. Nevertheless, early studies on Pr. shermanii [ 140] led to the view 
that the true intermediates were probably all adenosylated. This has received 
direct support by isolation of the adenosylated corrin diamide 79 from Pr. 
shermanii [141]. The triamide that follows 79 on the pathway was also isolated 
from this organism [141] and, though the site of the third amidation is un- 
known, this intermediate was identical to the triamide biosynthesised by Ps. 
denitrificans [ 142]. The combined evidence thus shows that much of the late part 
of the biosynthetic pathway in the microaerophilic organism Pr. shermanii flows 
in exactly the same way as in the aerobic Ps. denitrificans, that is 79 ^ 80 ^ 
81 — ^ 82 — ^ 83 — ^ 4, Scheme 33. 

7 

Summary and Conclusions 

Our review focused first on the tetrapyrrolic macrocycle, uro’gen III 10 which 
stands at that point in the biosynthetic process where the pathway to coenzyme Bj 2 
branches away from those to haem and chlorophyll. The present position is that the 
route from ALA 15 to uro’gen III 10 has been elucidated in considerable detail and 
substantial knowledge has been gained about the mechanisms of the processes 
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involved. The enzymes that catalyse these reactions are now available in pure state 
and, with the identification of their genes, have been overproduced, so opening the 
door to crystallisation of the protein and X-ray structure analysis. There will sure- 
ly be further progress in structural work on these enzymes and one particularly 
looks forward to the day when the active site of uro’gen III synthase can be seen. 
Further, the Laue X-ray approach for the study of at least one of these enzymes in 
action as it binds and manipulates the substrate offers exciting opportunities. 

Moving forward from uro’gen III, this review showed that there has been a 
transformation in knowledge of the biosynthesis of vitamin B 12 3 and coenzyme 
Bi 2 4; progress has been faster and more complete than anyone in the field dared 
to imagine. 

In Ps. denitrificans, the aerobic organism used for most of these recent re- 
searches, the conversion of ALA 15 through to coenzyme B 12 4 involves 22 steps 
and this entire pathway has been fully elucidated. The enzymes that catalyse 
essentially every step have been identified together with the corresponding 
genes. In the majority of cases, the genes have been overexpressed and the 
enzymes isolated, purified and characterised. Complementary to this biological 
research has been the chemical effort which has led to the elucidation of the 
structures of all the intermediates on the pathway and, starting with uro’gen III, 
there are 19 main intermediates leading to coenzyme B 12 4. Also, the mechanism 
and stereochemistry of a number of the important reactions involved in pro- 
ducing and transforming these intermediates have been worked out. However, 
the impression must not be given that the biological and chemical researches 
were in any way separate; they were tightly interlocked and each was essential 
for the other. Indeed, it was the synergistic combination of genetics, molecular 
biology, enzymology, synthetic and structural chemistry, isotopic labelling and 
NMR spectroscopy that led to the dramatic progress in solving the problem of 
Bi 2 -biosynthesis in Ps. denitrificans. 

Much is also known about the route from ALA to coenzyme B 12 in the micro- 
aerophilic organism Pr. shermanii. Of those steps that have been studied so 
far, a substantial majority occur using either identical intermediates to those 
established in Ps. denitrificans or they involve the cobalt complexes of the same 
organic ligands. There are differences, however, in that cobalt is inserted far 
earlier on the pathway in Pr. shermanii and an interesting modified mechanism 
is used for the ring contraction in this micro-aerophilic system. 

The final message is that the researches reviewed demonstrate the enormous 
power of the tools that are now available to biologists and bio-organic chemists for 
discovering how even the most complex substances are built by living systems. 
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